fbpx
Wikipedia

Ecoinformatics

Ecoinformatics, or ecological informatics, is the science of information in ecology and environmental science. It integrates environmental and information sciences to define entities and natural processes with language common to both humans and computers. However, this is a rapidly developing area in ecology and there are alternative perspectives on what constitutes ecoinformatics.

A few definitions have been circulating, mostly centered on the creation of tools to access and analyze natural system data. However, the scope and aims of ecoinformatics are certainly broader than the development of metadata standards to be used in documenting datasets. Ecoinformatics aims to facilitate environmental research and management by developing ways to access, integrate databases of environmental information, and develop new algorithms enabling different environmental datasets to be combined to test ecological hypotheses. Ecoinformatics is related to the concept of ecosystem services.[1]

Ecoinformatics characterize the semantics of natural system knowledge. For this reason, much of today's ecoinformatics research relates to the branch of computer science known as knowledge representation, and active ecoinformatics projects are developing links to activities such as the Semantic Web.

Current initiatives to effectively manage, share, and reuse ecological data are indicative of the increasing importance of fields like ecoinformatics to develop the foundations for effectively managing ecological information. Examples of these initiatives are National Science Foundation Datanet projects, DataONE, Data Conservancy, and Artificial Intelligence for Environment & Sustainability.[1]

Software Development Lifecycle edit

Central to the concept of ecoinformatics is the Software Development Lifecycle (SDLC), a systematic framework for writing, implementing, and maintaining software products. Typically in Ecoinformatics projects, the development pipeline includes data collection, usually from several different environmental data sources, then integrating these data sources together, and then analyzing the data. Here, each step of the SDLC is described in the context of ecoinformatics, per Michener et al.[2] It is important to note that the plan, collect, assure, describes and preserve steps refer to the data collection entity, which can be individual researchers or large data-collection networks, while the discover, integrate, and analyze steps typically refer to the individual researcher.

Plan: Ecoinformatics projects require data from several databases. Each database holds different data, and therefore researchers should identify what types of environmental or ecological data they will need to answer their research question.

Collect: Data is collected in several different ways. In ecoinformatics, this is usually restricted to manually entering data into a spreadsheet, and parsing data from an existing database. The growth of relational databases has made it easier for ecologists to download relevant data and integrate datasets together

Assure: Data entries should be checked thoroughly to validate their accuracy and usability, such as to check for outliers and erroneous points. The same principle applies to data downloaded from datasets. This responsibility falls on both the ecologist downloading the data, and the entity that sets up the data collection system.

Describe: An accurate description of the metadata of a dataset that is used in a study should include enough information to deduce the data collection and processing methodology, when the data were collected, why the data were collected, and how the data were stored. This is important for reproducibility, especially for projects that build on each other and may recycle data

Preserve: After data is collected by an institutional entity, it should be archived such that it is easily accessible. Ideally, this is in databases that are maintained and not at risk of deprecation

Discover: While there are good practices for discovering data to start a research project, this process is often marred by a lack of usable, published data, as researchers may collect data specific to their study, but may not publish this data for wider use. On the data collection end, this can be addressed by better data-sharing practices, such as by linking datasets when publishing papers or studies. On the data procurement end, this can be addressed by more precise data searching, such as using key words to find relevant datasets.

Integrate: Synthesizing datasets together can be difficult and labor-intensive, largely due to the methodological differences in data collection. There are several approaches to this, but the best practices typically involve computational approaches, namely using R or Python, to automate the processes and prevent errors

Analyze: Data analysis can take several forms, and should be tailored to the specific ecological project. However, all data analysis methods should be well-documented, including the procedure for analysis, justification for analysis methods, and any shortcomings in a specific approach.

Applications of Ecoinformatics Across Ecology edit

Ecosystem Ecology[3] edit

Ecosystem studies, by definition, encompass interactions across the entire life sciences spectrum, from microscopic biochemical reactions to large-scale geological phenomena. As a result, big databases may not be designed specifically for any particular research question, but should be inclusive enough to support most studies. Since ecosystem-level questions require a broad perspective, data-related ecosystem projects would likely incorporate data from several databases.

A common framework for incorporating data into ecosystem-level studies is the network science model, in which data collection mechanisms and resources are treated like a large, interconnected network instead of individual entities. The network may include several data collection stations within one databases, or may span across multiple databases. Currently there are several large-scale networks, but they do not generate data on the scale to consider ecology as a big data science.

A current challenge for ecoinformatics in ecosystem ecology is that most funding is prioritized for generating new data rather than maintaining existing data infrastructures. Integrating data across the different spatial scales can also be difficult, since each dataset may hold different types of data.

Urban Ecology[4] edit

The current push for smart cities, and sensor network integration into infrastructure, has positioned as a major source of data for ecological studies. Typical urban ecology questions address the effects of urbanization on the local ecosystem, and how to drive future development to promote urban biodiversity.

While sensor networks in cities typically collect environmental data to optimize city processes, they may also be used for ecological initiatives, especially with respect to understanding the complex, multi-layered relationship between cities and their local ecosystem. It can also be used to better understand the current landscape of cities, and identify avenues for rewinding of cities. For example, analyzing mobility patterns can identify areas that may lend themselves well to building parks and green spaces. Bird watching data can also be used to identify the types of bird species in a local area.

Infectious Disease[5] edit

Like other disciplines of ecology, emerging infectious disease and epidemiology span multiple scales, from understanding the genetics that drive disease trends to large-scale spatiotemporal analyses. As a result, infectious disease studies can incorporate everything from bioinformatics, genetic sequences, amino acid sequences, and environmental observation data.

On the micro-scale, these data can then be used to predict infectivity/transmissibility, drug resistance, drug candidates, and mutation sites. On the macro-scale, it can be used to identify societal trends or environmental factors that lend themselves to spillover, locations of infection, and practices that cause disease transmission.

Databases[6] edit

  • USGS National Streamflow sensor network
  • GBIF
  • Neotoma
  • Paleobiology database
  • European Vegetation Archive
  • USDA Forest Inventory Analysis
  • TRY
  • BIEN
  • AmeriFlux
  • TEAM
  • iNaturalist
  • NEON
  • GLEON
  • LTER
  • CZO
  • TERN
  • SAEON

References edit

  1. ^ a b Villa, Ferdinando; Ceroni, Marta; Bagstad, Ken; Johnson, Gary; Krivov, Sergey (2009-01-01). "ARIES (ARtificial Intelligence for Ecosystem Services): A new tool for ecosystem services assessment, planning, and valuation". ResearchGate. Retrieved 2022-01-23.
  2. ^ Michener, William K.; Jones, Matthew B. (February 2012). "Ecoinformatics: supporting ecology as a data-intensive science". Trends in Ecology & Evolution. 27 (2): 85–93. doi:10.1016/j.tree.2011.11.016. ISSN 0169-5347. PMID 22240191. S2CID 12268743.
  3. ^ LaDeau, S. L.; Han, B. A.; Rosi-Marshall, E. J.; Weathers, K. C. (2017-03-01). "The Next Decade of Big Data in Ecosystem Science". Ecosystems. 20 (2): 274–283. Bibcode:2017Ecosy..20..274L. doi:10.1007/s10021-016-0075-y. ISSN 1435-0629.
  4. ^ Yang, Jun (2020-10-01). "Big data and the future of urban ecology: From the concept to results". Science China Earth Sciences. 63 (10): 1443–1456. Bibcode:2020ScChD..63.1443Y. doi:10.1007/s11430-020-9666-3. ISSN 1869-1897. S2CID 221285047.
  5. ^ Kasson, Peter M. (2020-07-20). "Infectious Disease Research in the Era of Big Data". Annual Review of Biomedical Data Science. 3 (1): 43–59. doi:10.1146/annurev-biodatasci-121219-025722. ISSN 2574-3414.
  6. ^ Farley, Scott S; Dawson, Andria; Goring, Simon J; Williams, John W (2018-07-18). "Situating Ecology as a Big-Data Science: Current Advances, Challenges, and Solutions". BioScience. 68 (8): 563–576. doi:10.1093/biosci/biy068. ISSN 0006-3568.

External links edit

  • ecoinformatics.org, Online Resource for Managing Ecological Data and Information
  • , Research links and public wiki for discussion.
  • , Ecosystem Informatics at Oregon State University
  • industrial Environmental Informatics, Industrial Environmental Informatics at HTW-Berlin, University of Applied Sciences
  • , Ecoinformatics at the University of Ottawa, Canada
  • Ecoinformatics program at the National Center for Ecological Analysis & Synthesis 2014-11-05 at the Wayback Machine
  • Ecological Informatics: An International Journal on Computational Ecology and Ecological Data Science
  • Ecological Data
  • NSF DataNet call for proposals
  • DataONE
  • Data Conservancy
  • [1], EcoInformatics Summer Institute, an NSF-funded REU site (Research Experience for Undergraduates)

ecoinformatics, this, article, needs, additional, citations, verification, please, help, improve, this, article, adding, citations, reliable, sources, unsourced, material, challenged, removed, find, sources, news, newspapers, books, scholar, jstor, october, 20. This article needs additional citations for verification Please help improve this article by adding citations to reliable sources Unsourced material may be challenged and removed Find sources Ecoinformatics news newspapers books scholar JSTOR October 2015 Learn how and when to remove this template message Ecoinformatics or ecological informatics is the science of information in ecology and environmental science It integrates environmental and information sciences to define entities and natural processes with language common to both humans and computers However this is a rapidly developing area in ecology and there are alternative perspectives on what constitutes ecoinformatics A few definitions have been circulating mostly centered on the creation of tools to access and analyze natural system data However the scope and aims of ecoinformatics are certainly broader than the development of metadata standards to be used in documenting datasets Ecoinformatics aims to facilitate environmental research and management by developing ways to access integrate databases of environmental information and develop new algorithms enabling different environmental datasets to be combined to test ecological hypotheses Ecoinformatics is related to the concept of ecosystem services 1 Ecoinformatics characterize the semantics of natural system knowledge For this reason much of today s ecoinformatics research relates to the branch of computer science known as knowledge representation and active ecoinformatics projects are developing links to activities such as the Semantic Web Current initiatives to effectively manage share and reuse ecological data are indicative of the increasing importance of fields like ecoinformatics to develop the foundations for effectively managing ecological information Examples of these initiatives are National Science Foundation Datanet projects DataONE Data Conservancy and Artificial Intelligence for Environment amp Sustainability 1 Contents 1 Software Development Lifecycle 2 Applications of Ecoinformatics Across Ecology 2 1 Ecosystem Ecology 3 2 2 Urban Ecology 4 2 3 Infectious Disease 5 3 Databases 6 4 References 5 External linksSoftware Development Lifecycle editCentral to the concept of ecoinformatics is the Software Development Lifecycle SDLC a systematic framework for writing implementing and maintaining software products Typically in Ecoinformatics projects the development pipeline includes data collection usually from several different environmental data sources then integrating these data sources together and then analyzing the data Here each step of the SDLC is described in the context of ecoinformatics per Michener et al 2 It is important to note that the plan collect assure describes and preserve steps refer to the data collection entity which can be individual researchers or large data collection networks while the discover integrate and analyze steps typically refer to the individual researcher Plan Ecoinformatics projects require data from several databases Each database holds different data and therefore researchers should identify what types of environmental or ecological data they will need to answer their research question Collect Data is collected in several different ways In ecoinformatics this is usually restricted to manually entering data into a spreadsheet and parsing data from an existing database The growth of relational databases has made it easier for ecologists to download relevant data and integrate datasets togetherAssure Data entries should be checked thoroughly to validate their accuracy and usability such as to check for outliers and erroneous points The same principle applies to data downloaded from datasets This responsibility falls on both the ecologist downloading the data and the entity that sets up the data collection system Describe An accurate description of the metadata of a dataset that is used in a study should include enough information to deduce the data collection and processing methodology when the data were collected why the data were collected and how the data were stored This is important for reproducibility especially for projects that build on each other and may recycle dataPreserve After data is collected by an institutional entity it should be archived such that it is easily accessible Ideally this is in databases that are maintained and not at risk of deprecationDiscover While there are good practices for discovering data to start a research project this process is often marred by a lack of usable published data as researchers may collect data specific to their study but may not publish this data for wider use On the data collection end this can be addressed by better data sharing practices such as by linking datasets when publishing papers or studies On the data procurement end this can be addressed by more precise data searching such as using key words to find relevant datasets Integrate Synthesizing datasets together can be difficult and labor intensive largely due to the methodological differences in data collection There are several approaches to this but the best practices typically involve computational approaches namely using R or Python to automate the processes and prevent errorsAnalyze Data analysis can take several forms and should be tailored to the specific ecological project However all data analysis methods should be well documented including the procedure for analysis justification for analysis methods and any shortcomings in a specific approach Applications of Ecoinformatics Across Ecology editEcosystem Ecology 3 edit Ecosystem studies by definition encompass interactions across the entire life sciences spectrum from microscopic biochemical reactions to large scale geological phenomena As a result big databases may not be designed specifically for any particular research question but should be inclusive enough to support most studies Since ecosystem level questions require a broad perspective data related ecosystem projects would likely incorporate data from several databases A common framework for incorporating data into ecosystem level studies is the network science model in which data collection mechanisms and resources are treated like a large interconnected network instead of individual entities The network may include several data collection stations within one databases or may span across multiple databases Currently there are several large scale networks but they do not generate data on the scale to consider ecology as a big data science A current challenge for ecoinformatics in ecosystem ecology is that most funding is prioritized for generating new data rather than maintaining existing data infrastructures Integrating data across the different spatial scales can also be difficult since each dataset may hold different types of data Urban Ecology 4 edit The current push for smart cities and sensor network integration into infrastructure has positioned as a major source of data for ecological studies Typical urban ecology questions address the effects of urbanization on the local ecosystem and how to drive future development to promote urban biodiversity While sensor networks in cities typically collect environmental data to optimize city processes they may also be used for ecological initiatives especially with respect to understanding the complex multi layered relationship between cities and their local ecosystem It can also be used to better understand the current landscape of cities and identify avenues for rewinding of cities For example analyzing mobility patterns can identify areas that may lend themselves well to building parks and green spaces Bird watching data can also be used to identify the types of bird species in a local area Infectious Disease 5 edit Like other disciplines of ecology emerging infectious disease and epidemiology span multiple scales from understanding the genetics that drive disease trends to large scale spatiotemporal analyses As a result infectious disease studies can incorporate everything from bioinformatics genetic sequences amino acid sequences and environmental observation data On the micro scale these data can then be used to predict infectivity transmissibility drug resistance drug candidates and mutation sites On the macro scale it can be used to identify societal trends or environmental factors that lend themselves to spillover locations of infection and practices that cause disease transmission Databases 6 editUSGS National Streamflow sensor network GBIF Neotoma Paleobiology database European Vegetation Archive USDA Forest Inventory Analysis TRY BIEN AmeriFlux TEAM iNaturalist NEON GLEON LTER CZO TERN SAEONReferences edit a b Villa Ferdinando Ceroni Marta Bagstad Ken Johnson Gary Krivov Sergey 2009 01 01 ARIES ARtificial Intelligence for Ecosystem Services A new tool for ecosystem services assessment planning and valuation ResearchGate Retrieved 2022 01 23 Michener William K Jones Matthew B February 2012 Ecoinformatics supporting ecology as a data intensive science Trends in Ecology amp Evolution 27 2 85 93 doi 10 1016 j tree 2011 11 016 ISSN 0169 5347 PMID 22240191 S2CID 12268743 LaDeau S L Han B A Rosi Marshall E J Weathers K C 2017 03 01 The Next Decade of Big Data in Ecosystem Science Ecosystems 20 2 274 283 Bibcode 2017Ecosy 20 274L doi 10 1007 s10021 016 0075 y ISSN 1435 0629 Yang Jun 2020 10 01 Big data and the future of urban ecology From the concept to results Science China Earth Sciences 63 10 1443 1456 Bibcode 2020ScChD 63 1443Y doi 10 1007 s11430 020 9666 3 ISSN 1869 1897 S2CID 221285047 Kasson Peter M 2020 07 20 Infectious Disease Research in the Era of Big Data Annual Review of Biomedical Data Science 3 1 43 59 doi 10 1146 annurev biodatasci 121219 025722 ISSN 2574 3414 Farley Scott S Dawson Andria Goring Simon J Williams John W 2018 07 18 Situating Ecology as a Big Data Science Current Advances Challenges and Solutions BioScience 68 8 563 576 doi 10 1093 biosci biy068 ISSN 0006 3568 External links editecoinformatics org Online Resource for Managing Ecological Data and Information Ecoinformatics Collaboratory Research links and public wiki for discussion Ecoinformatics Education Ecosystem Informatics at Oregon State University industrial Environmental Informatics Industrial Environmental Informatics at HTW Berlin University of Applied Sciences International Society for Ecological Informatics Canadian Facility for Ecoinformatics Research Ecoinformatics at the University of Ottawa Canada Ecoinformatics program at the National Center for Ecological Analysis amp Synthesis Archived 2014 11 05 at the Wayback Machine Ecological Informatics An International Journal on Computational Ecology and Ecological Data Science Ecological Data NSF DataNet call for proposals DataONE Data Conservancy 1 EcoInformatics Summer Institute an NSF funded REU site Research Experience for Undergraduates Retrieved from https en wikipedia org w index php title Ecoinformatics amp oldid 1209662443, wikipedia, wiki, book, books, library,

article

, read, download, free, free download, mp3, video, mp4, 3gp, jpg, jpeg, gif, png, picture, music, song, movie, book, game, games.