The Genome Sequence DataBase (GSDB): Improving data quality and data access

C Harger; M Skupski; J Bingham; A Farmer; S Hoisie; P Hraber; D Kiphart; L Krakowski; M McLeod; J Schwertfeger; G Seluja; A Siepel; G Singh; D Stamper; P Steadman; N Thayer; R Thompson; P Wargo; M Waugh; JJ Zhuang; Peter Schad

The Genome Sequence DataBase (GSDB)

Improving data quality and data access

Harger, C., Skupski, M., Bingham, J., Farmer, A., Hoisie, S., Hraber, P., Kiphart, D., Krakowski, L., McLeod, M., Schwertfeger, J., Seluja, G., Siepel, A., Singh, G., Stamper, D., Steadman, P., Thayer, N., Thompson, R., Wargo, P., Waugh, M., ... Schad, P. (1998). The Genome Sequence DataBase (GSDB): Improving data quality and data access. Nucleic Acids Research, 26(1), 21-26. https://doi.org/10.1093/nar/26.1.21

Copy citation

Abstract

In 1997 the primary focus of the Genome Sequence DataBase (GSDB; www.ncgr.org/gsdb) located at the National Center for Genome Resources was to improve data quality and accessibility. Efforts to increase the quality of data within the database included two major projects; one to identify and remove all vector contamination from sequences in the database and one to create premier sequence sets (including both alignments and discontiguous sequences). Data accessibility was improved during the course of the last year in several ways. First, a graphical database sequence viewer was made available to researchers. Second, an update process was implemented for the web-based query tool, Maestro. Third, a web-based tool, Excerpt, was developed to retrieve selected regions of any sequence in the database. And lastly, a GSDB flatfile that contains annotation unique to GSDB (e.g., sequence analysis and alignment data) was developed. Additionally, the GSDB web site provides a tool for the detection of matrix attachment regions (MARs), which can be used to identify regions of high coding potential. The ultimate goal of this work is to make GSDB a more useful resource for genomic comparison studies and gene level studies by improving data quality and by providing data access capabilities that are consistent with the needs of both types of studies.

Publications Info

To contact an RTI author, request a report, or for additional information about publications by our experts, send us your request.

publications@rti.org

RTI shares its evidence-based research - through peer-reviewed publications and media - to ensure that it is accessible for others to build on, in line with our mission and scientific standards.

Recent Publications

Article

Factors influencing wasting in children under 5 in arid regions of Kenya

March 2026

Article

Psychometric evaluation of the weekly version of the PTSD checklist for DSM-5

March 2026

Article

Uptake of newly licensed influenza vaccine formulations among patients receiving chronic hemodialysis during the 2010/2011 to 2021/2022 influenza seasons

March 2026

Article

Multi-ancestry genome-wide association study and meta-analysis of lung function decline

February 2026

Article

A microsimulation model to assess the cost-effectiveness of physical activity policies among US adults: The physical activity, diabetes, and cardiovascular disease model

February 2026

Article

Estimating lifetime drinking trajectories for alcohol use from adolescence to older adulthood in the United States: A three-step approach

February 2026

Article

Challenging behaviors across COVID-19 in young children with rare neurogenetic conditions: A seven-year, cross-syndrome analysis

February 2026

Article

Hypocretin receptor 1 blockade early in abstinence reduces future demand for cocaine

February 2026

View All Publications