Cyberinfrastructure in an Era Observation and Simulation

subduedjourneySoftware and s/w Development

Oct 28, 2013 (3 years and 11 months ago)

157 views

Oceans Observations

Environmental Obs

Satellites

Earth System Modeling

Cyberinfrastructure in an Era
of

Observation and Simulation

EarthScope

Water

Eva Zanzerkia, NSF GEO/EAR

This Talk


Context at the National and Agency scale


CIF21


Data Management


What this means
for CUAHSI

Framing the Challenge:

Science and Society Transformed by Data


Modern science


Data
-

and compute
-
intensive


Integrative
, multiscale



Multi
-
disciplinary
Collaborations for
Complexity


Individuals, groups, teams
,
communities



Sea of Data


Age of Observation


Distributed, central
repositories, sensor
-

driven,
diverse, etc


NSF ACCI: Innovation
and New
Paradigms


Multiple communities must engage rapidly,
integrate observations, expertise

o
New multidisciplinary research communities and teams

o
Teams notified of events by social networks, mobile


Simulations by global teams


New types of compute, data, collaboration and software
infrastructure


Comprehensive Computational and Data
-
Enabled Science & Engineering (CDS&E)


Fundamental to support 21
st

century science and
engineering

New Budget Thrust for 2012: CIF 21


Broad Principles of CIF21


Comprehensive

and
integrated

cyberinfrastructure

to
transform research, innovation and
education


Focus on computational and data
-
intensive science to
address
complex
problems


Increase
of $117 million over FY 2010
enacted level


Four major components


Caveat
: This is only the president’s budget
request to congress




Discovery

Collaboration

Education

Organizations


Univ ersities, schools


Gov ernment labs, agencies


Research and Medical
Centers


Libraries, Museums


Virtual Organizations


Communities

Expertise


Research and Scholarship


Education


Learning and Workforce
Dev elopment


Interoperability and operations


Cy berscience

Networking



Campus, national, international
networks


Research and experimental networks


End
-
to
-
end throughput


Cy bersecurity

Computational
Resources


Supercomputers


Clouds, Grids, Clusters


Visualization


Compute services


Data Centers

Data



Databases, Data repositories


Collections and Libraries


Data Access; storage,
navigation


management, mining tools,


curation
, privacy

Scientific
Instruments


Large Facilities
,
MREFCs,telescopes,


Colliders,
shake
Tables, laboratory,
field deployment, Sensor
Arrays



Software



Applications, middleware


Software development and
support

Cy bersecurity
: access,


authorization, authentication

New Computational

Resources

Community

Research

Networks

Access and

Connections to


Resources

CIF21: Four Thrust Areas

Data
-
Enabled

Science

Data
-
Enabled Science


Data Services Program
(data)



Provide reliable digital preservation, access, integration, and
analysis capabilities for science and/or engineering data over a
decades
-
long timeline



Data Analysis and Tools Program
(information)


Data mining, manipulation, modeling, visualization, decision
-
making systems



Data
-
intensive Science Program
(knowledge)


Intensive disciplinary efforts, multi
-
disciplinary discovery and
innovation

Observations

Models

Climate

GIS

Remote Sensing

New
Computational Infrastructure


Computational
and data
-
enabled
resources


HPC, Clouds, Clusters, Data Centers



Long
-
term software for science and engineering


Sustained software development and support



Discipline
-
specific activities


Services, tools, compute environments that serve specific
research efforts and communities

Wyoming SCS

Access
and Connectivity


Network connections and engineering program


Real
-
time access to facilities and instruments; Begins to tie in MREFC
activities


Integration and end
-
to
-
end performance to provide seamless access from
researcher to resource



Cybersecurity



from innovation to practice


Deployment of identity management systems


Development of
cybersecurity

prototypes


Community
Research Networks


New multi
-
disciplinary
research communities


Address challenges beyond individuals and disciplinary research
communities


Support and optimize collaboration across small, mid
-
level and large
community networks



Advanced
research on community and social
networks


Structures, leadership, fostering and sustainability


“virtuous cycle” providing feedback through formal evaluation and program
iteration


CIF21


Geosciences

Foci


GEO plans to leverage substantial CI investments


Connect Well
-
established facilities: ES, OOI, IRIS, UNAVCO, IEDA, NCAR, CIG, CUAHSI,etc....


EAR Geoinformatics program


Participation in NSF
-
wide activities: PetaApps, SI2


Partnerships with other agencies and industry


Long
-
term plan: connect resources, science, users through
National

cyberinfrastructure


New and enhanced computational platforms, tools, data centers to analyze, manipulate, visualize
and share large and complex data sets


a
framework for open and easy access of all geoscience data and integrationg with other domains


Infrastructure and technology for retrieving and sharing observational data


Sustained training programs to create a workforce capable of multi
-
disciplinary science



EarthCube


GEO and OCI are working in partnership to support
the development of a geosciences wide
cyberinfrastructure


DCL: nsf11065
http://www.nsf.gov/pubs/2011/nsf11065/nsf11065.j
sp?WT.mc_id=USNSF_179


Webinar July 11, 2011.


Early Fall:
charrette

meeting to bring the geosciences
community together and focus a design


Prototypes (up to 3) will be supported in early Spring

Multiple Modes of Support Are Necessary for
EarthCube

and CIF21



M
odes
of support
” that are essential to
build CIF21
infrastructure and to
engage in CIF21 activities.


Focused grants
to individual PIs
or small groups


Focused programs that are community driven


Small centers


Large national centers


Cyber
-
enhanced field programs


Cyber
-
enhanced observing facilities and MREFC projects


NSF
-
wide initiatives


Education, outreach, and training activities (
EOT)

Modes of
Support

Data
-
Enabled Science

New
Computational
Infrastructure

Access and Connection

Networks

EarthCube

10
-
15 Year Timeline

CIF21 and CUAHSI


CIF21 and
EarthCube

is still being developed. There is the opportunity
to impact implementation through participation
.


CIF21 dovetail with CUAHSI’s Strategic Plan:


Observation and Synthesis


improving accessibility to observational networks


data discovery and community modeling


Data Access


maintain services for diverse data


develop citation and tracking mechanisms


emerging data types and needs


outreach for standards



What are the capabilities that water sciences researchers will need in
the future?

Where can CI Enhance Partnerships?


Connect observing systems


Connect to other domains

and resources:


DataNet; computational facilities; synthesis centers

Data Management Plans

Long
-
standing policy states NSF’s expectations with respect
to sharing of data and other research
products:



Grant Conditions:

“NSF expects investigators to share with
other researchers, at no more than incremental cost and within a
reasonable time, the data, samples, physical collections and other
supporting materials created or gathered in the course of the
work. It also encourages grantees to share software and
inventions or otherwise act to make the innovations they embody
widely useful and
usable.”

Data Management Plan


All
proposals must include
, as a
supplementary doc up to
2 pages, a Data Management Plan.


Plan should describe how the proposal will conform to
NSF policy on dissemination and sharing of research
results.


Plan
will be reviewed as part of the intellectual merit
and/or broader impacts of
the proposal depending on the
proposal intent.


FastLane

will not permit submission of a proposal that is
missing a data management plan.




What’s Included?

o

Anticipated data, samples, physical collections, software,
curriculum materials, anything produced during the reseach.


o

Information on analytical standards, metadata, solutions or
remedies where these do not exist.


o

Plans for data/sample access and sharing, including provision for
protection of privacy/confidentiality/ security, intellectual property,
or other rights, if appropriate.


o

Plans for re
-
use, re
-
distribution, and production of derived data
sets.


o

Plans for archiving/preserving data, samples, and other research
products and providing access to others.

Data Management Plan Resources


http://www.nsf.gov/geo/geo
-
data
-
policies/index.jsp


Data Policies and Community Standards


Templates


Review of DMP is an evolving process. More
guidance on review will be forthcoming.


Opportunities for CUAHSI



CIF21: Integration and Partnerships


Take advantage of resources


Wyoming Supercomputing Center


Powell Center


TeraGrid (XD)


DataNets: DataOne, Data Conservancy


Upcoming Competitions


SI2: NSF 11
-
539 July 18, 2011


EAR Geoinformatics (?): Will be tied to CIF21 and reports from recent
Geoinformatics WS



Data management plans are an opportunity


Set community standards for peer review


Provide resources for a broad community



Articulate your science needs and tools


Be flexible: technology advance outpaces integration into science;


Unanticipated Scientific Discovery is still allowed!