Tuesday, November 21 , 2017, 3:25 pm | Fair 79º


UCSB’s NCEAS Upgrades Widely Used Scientific Data Repository

New enhancements feature a superfast search function, citations for data and an intuitive user interface

With small labs, field stations and individual researchers collectively producing the majority of the scientific data, the task of storing, sharing and finding the millions of smaller datasets requires a widely available, flexible and robust long-term data management solution. This is especially true now that the National Science Foundation — and a growing number of scientific journals — require authors to openly store and share their research data.

Matt Jones
Matthew Jones

In response, UC Santa Barbara’s National Center for Ecological Analysis and Synthesis (NCEAS) has released a major upgrade to the KNB Data Repository (formerly the Knowledge Network for Biocomplexity). The upgrade improves access to and better supports the data management needs of ecological, environmental and earth science labs and individual researchers.

The repository stores data related to a diverse range of topics, from Influenza A subtypes in wild birds to decadal scale changes in coral reefs in the United States Virgin Islands to 60 years of plankton data from Lake Baikal.

Thousands of individual researchers, dozens of field stations and even large research organizations, such as the Partnership for Interdisciplinary Studies of Coastal Oceans (PISCO) and the Long Term Ecological Research (LTER) Network, use the KNB to collaborate with colleagues and preserve data for the benefit of science.

The major overhaul to the KNB improves data access by making the repository more responsive with an intuitive, multifaceted search interface that is exponentially faster than the previous version. Queries across the entire repository now take less than a second, which makes finding data and potential collaborators faster and easier than ever.

The upgrade enables researchers to assign digital object identifiers (DOIs) to their data, so their work can be cited easily in science journals and credited when other scientists use their data. Designating a DOI is as simple as clicking the publish button, which makes the dataset publicly available and registers the DOI. Researchers can choose to share their data with only a small group of collaborators before releasing the information publicly prior to publication.

The system also features a new user interface and an improved search function, which make the repository easier to use.

“The new KNB interface is very impressive,” said Margaret O’Brien, a research scientist at UCSB’s Marine Science Institute and information manager for the Santa Barbara Coastal LTER project. “Showing how many datasets are available is very useful, and the search filter showing how many datasets to expect will help users tailor their input.”

As one of the founding member nodes of the DataONE network, the KNB Data Repository contributes to the diverse collection of data within the network, ensuring reliable, distributed storage of valuable research data for decades to come. NCEAS researchers Matthew Jones and Mark Schildhauer created the repository in collaboration with the LTER Network.

The KNB is built on the Metacat data repository software system, which is open source and freely available for other research groups to use to deploy their own repositories and link them into the DataONE federation. The KNB Data Repository is available free of charge for researchers, and scientists have stored tens of thousands of datasets on the service.

The NSF originally funded the project in 1998. Funding for enhancements came from NSF, the Andrew W. Mellon Foundation and the Gordon and Betty Moore Foundation.

  • Ask
  • Vote
  • Investigate
  • Answer

Noozhawk Asks: What’s Your Question?

Welcome to Noozhawk Asks, a new feature in which you ask the questions, you help decide what Noozhawk investigates, and you work with us to find the answers.

Here’s how it works: You share your questions with us in the nearby box. In some cases, we may work with you to find the answers. In others, we may ask you to vote on your top choices to help us narrow the scope. And we’ll be regularly asking you for your feedback on a specific issue or topic.

We also expect to work together with the reader who asked the winning questions to find the answer together. Noozhawk’s objective is to come at questions from a place of curiosity and openness, and we believe a transparent collaboration is the key to achieve it.

The results of our investigation will be published here in this Noozhawk Asks section. Once or twice a month, we plan to do a review of what was asked and answered.

Thanks for asking!

Click here to get started >

Support Noozhawk Today

You are an important ally in our mission to deliver clear, objective, high-quality professional news reporting for Santa Barbara, Goleta and the rest of Santa Barbara County. Join the Hawks Club today to help keep Noozhawk soaring.

We offer four membership levels: $5 a month, $10 a month, $25 a month or $1 a week. Payments can be made through PayPal below, or click here for information on recurring credit-card payments.

Thank you for your vital support.

Daily Noozhawk

Subscribe to Noozhawk's A.M. Report, our free e-Bulletin sent out every day at 4:15 a.m. with Noozhawk's top stories, hand-picked by the editors.

Sign Up Now >