ESIP
Browse

Geoweaver: Affordable Workflow Tool for Earth AI Experiments

Download (3.69 MB)
poster
posted on 2022-04-14, 19:15 authored by Ziheng SunZiheng Sun, Annie BurgessAnnie Burgess, Nicoleta Cristea, Daniel Q. Tong, Marshall MaMarshall Ma, Ahmed Alnaim

Earth scientists spend numerous hours on exploring new technologies to learn new knowledge and find critical information from the obtained Earth observations. There are many duplicated efforts happening in every corner of the universities and research institutes, and it is common to find people struggling to solve the technical problems that others might already meet and solve. However, due to disorganization and less sharable format, all these efforts vanished over time and even the authors cannot reproduce them after a period of time. It is not a new problem and there are solutions already (e.g., using professional workflow management software), but requiring users to invest even more time to learn and use them. The barrier and cost are too high to justify their benefits. In light of this problem, we developed a scientist-affordable workflow tool to organize their AI/ML workflows and record all the source code changes and logs of every model run. It is a utility tool which doesn't require much time to learn and they can take the Geoweaver workflow package and run it by themselves even without Geoweaver installed. The entry of barrier is lowered to the level that any people with Python scripting experience can easily use it. The workflow automation, provenance recording, history exportation, are completely under the hood and make scientists focus on the problem solving and information extraction over worrying about their technical debt and potential loss of their experiment history or source code. Using Geoweaver is a long term investment and will make every scientist' work fresh preserved and understandable even years after the original experiments. This presentation was given at the 2022 NASA ESDSWG Meeting (April 19-21, 2022).

Funding

NASA ACCESS#80NSSC21M0028

NSF EAR#1947875, #1947893

NSF OAC#2117834

History

Usage metrics

    Deposit files

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC