A Cyberinfrastructure Model for
Ecological Applications
T. Little, B.
Anderson, P. Cryan, M. Friedl, S. Gauthreaux, J. Kelly, T. Kunz, R. Larkin
January 25, 2006
Many
fundamental questions in environmental sciences cannot be addressed without
coordination between disciplines and access to multiple data sets. One such
question concerns the nature of bird and bat movements at multiple spatial and
temporal scales. Knowledge of how
proximate factors influence these movements promises to inform understanding of
complex relationships among ecological processes, weather patterns, land forms,
and land cover. With greater
understanding of these interactions, it will be possible to provide real-time
observations, predictions, and forecasts of potential adverse affects on animal
populations and health risks to humans.
For example, knowing how extrinsic factors affect migratory events of
birds and bats could be used to inform decision makers and developers of wind
energy facilities on how best to mitigate fatalities, and knowledge of seasonal
migrations would help inform the public health community regarding how to
mitigate the effects of emerging diseases. A
key challenge currently limiting the capacity scientists to develop such
understanding is that available data sets are often incomplete, inaccessible,
lack consistent semantics, or are otherwise difficult to integrate in a manner
that can lead to useful scientific inquiry.
This
proposal describes research to establish an environmental cyberinfrastructure
(CI) prototype that integrates multiple environmental data sets and
demonstrates new analytical capabilities to better understand how multiple
extrinsic factors affect animal movements. The goal of this research is to
create a rich collaborative environment among environmental scientists, domain
specialists, and software developers that will lead to a successful use of
modern CI. Driven by tangible scientific questions, the proposed research team
will create a common lexicon and ontology appropriate for adopting CI
components to better understand animal movements at different temporal and
spatial scales. Software will be developed that exploits existing CI to expose
multiple data sets through a common user and programmer interface that is
amenable to both scientific inquiry and use of existing Grid computing
middleware. The software
infrastructure will provide standard web services interfaces suitable for (a)
mature scientific analysis tools (e.g., R, IDL, ArcGIS, Matlab), (b) composite
applications from other web services (e.g., integration with open source GIS
tools and pop-software such as Google Earth), (c) exposure of data through
dynamically-generated web portals, and (d) the development of stand alone
applications that demonstrate Ôvirtual observatories.Õ The research will focus on data sets
originating at multiple spatial and temporal scales including remotely sensed
data, meteorological observations, NEXRAD Doppler radar, census, and
ground-based animal sensor network systems. We describe several major use cases
for this application, and propose user-centric methodologies for more extensive
design of these use cases. The proposed work will involve the analysis, design,
and adoption of CI software to meet the requirements of use cases established
by a scientific community comprised of geographers, ecologists, biologists, and
meteorologists. Significant
efforts will be committed to identification and dissemination of this
experience for influencing future use of CI in ecological networks.
Broader
Impacts: The PIs will
develop two new courses suitable for education and advancement of young
scientists and engineers in the interdisciplinary domains of terrestrial,
atmospheric, and information technology research. The first will focus on
questions applied to the intersection of the targeted data sets in this
proposal. The second course will be achieved by the creation of a software
development lab in which both undergraduates and project participants will be
exposed to collaborative design and development of robust web services based on
open source CI software created by multiple generations of computer systems
engineers. The enabling of complex queries and analysis of relevant data sets
has the potential to empower researchers to investigate a broad set of environmental
questions related to the data sets beyond North America. Thus, this broader
impact will serve as a model for enabling analysis of other environmental
questions in other geographic regions.
Impacts from this research are also expected on public policy, due to
the application domain, and in the education of future ecologists, biologist,
and engineers. The evolution of IT education towards new CI tools and methods
is relevant to national priorities of global competitiveness, and the education
of a new generation of environmental scientists capable of addressing
increasingly complex questions.