Information Services

Paper: Program Description Abstract

Design of a Data Catalog to Support Discovery, Acquisition, and Use of Data Sets in a Secure Research Environment

Sunday, May 5
5:20 PM - 5:35 PM
Room: Columbus GH (East Tower, Ballroom/Gold Level)

Background : Since 2015, our medical school’s library has supported a secure computational environment, known as Data Core, which allows researchers to work collaboratively on projects involving sensitive research data. As of October 2018 the Data Core supports 20 researchers spanning 38 active projects and a total of 41 datasets governed by data use agreements (DUAs) with external agencies, including the Centers for Medicare and Medicaid Services. Data Core users have recently expressed interest in a searchable online system that would allow for discoverability of relevant research data sets and provide information relevant to data governance, acquisition, and use.
Description : In response to these conversations we have begun developing a data catalog with online search functionality. To create a data dictionary for the catalog we first selected applicable terms from the Dublin Core Metadata Initiative (DCMI) and the Data Catalog Vocabulary (DCAT), an RDF vocabulary designed to facilitate interoperability between data catalogs published on the Web. We supplemented this list with data elements reflecting institution-specific data governance considerations – for example, whether an additional attestation must be submitted for each new user, and the names of other principal investigators who have previously used a given data use agreement (DUA).
Conclusion : We are in the process of populating the Data Catalog with metadata on existing internal and external data sets and DUAs, and are developing a structured plan to make the Data Catalog searchable via searches of indexed fields as well as via a full-text search of metadata.

Michael Eliot Bales

Research Impact and Evaluation Informationist
Weill Cornell Medicine
New York, New York

Michael Bales is a Research Impact and Evaluation Informationist at the Samuel J. Wood Library at Weill Cornell Medicine (WCM). A PhD graduate of the Department of Biomedical Informatics at Columbia University, Michael has authored articles on scientific social networking, geographic information systems, and autism genomics. Prior to his doctoral studies he obtained a Master of Public Health in Epidemiology at the University of Minnesota, then completed a Public Health Informatics Fellowship at the U.S. Centers for Disease Control and Prevention. His work at WCM focuses on publication data management, research impact assessment, and informatics policy. Michael also serves as an informationist in the Clinical and Translational Science Center, a WCM multi-institutional consortium.

Presentation(s):

Send Email for Michael Bales

Peter Robert Oxley

Associate Director of Research Services
Weill Cornell Medicine
New York, New York

Peter Oxley, PhD, is the Associate Director of Research at the Samuel J. Wood Library, Weill Cornel Medicine, New York. His primary responsibilities are to manage the Data Core, the library bioinformatics service, and the scientific software hub. The Data Core is a secure enclave computing environment for storing and analyzing research data, including protected health information. The Data Core is also developing a data catalog to manage data discoverability, accessibility, and governance. The bioinformatics service provides workshops, seminars and consultations on bioinformatics and data science analysis to staff, faculty and students. Our scientific software hub is a centralized software information and license request service for the medical college.

Presentation(s):

Send Email for Peter Oxley

Terrie R. Wheeler

Director
Weill Cornell Medicine
New York, NY

Presentation(s):

Send Email for Terrie Wheeler


Assets

Design of a Data Catalog to Support Discovery, Acquisition, and Use of Data Sets in a Secure Research Environment

Audio Slides Video

Attendees who have favorited this

Please enter your access key

The asset you are trying to access is locked. Please enter your access key to unlock.

Send Email for Design of a Data Catalog to Support Discovery, Acquisition, and Use of Data Sets in a Secure Research Environment