paperKB
coga / coga-kb
Help
Sign in

Chunk #11 — ACCESSING THE REFSEQ DATASET

Source
Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation.
Embedded
yes

Text

RefSeq sequence data can be accessed interactively using NCBIs Nucleotide and Protein databases, in BLAST databases, through NCBI's programmatic interface (E-utilities), or through file transfer protocol (FTP). E-utilities support scripted access to download RefSeq data in a variety of formats based on either search terms or accession lists; extensive documentation is available in the NCBI Handbook (www.ncbi.nlm.nih.gov/books/NBK25501/) and training videos are available from NCBI's YouTube channel (https://www.youtube.com/user/NCBINLM). Both the Nucleotide and Protein databases allow for query results to be restricted to only RefSeq records by selecting ‘RefSeq’ under the ‘Source database’ in the filters sidebar. RefSeq data may also be accessed from other NCBI databases including Assembly, BioProject, Gene, and Genome by following the links provided to Nucleotide, Protein, or FTP resources Information on curation changes within the RefSeq group or NCBI updates that impact the RefSeq database are reported through several sources including RefSeq FTP release notes, periodic published reports, the NCBI Announcements News feed http://www.ncbi.nlm.nih.gov/news/ and through the NCBI Insights Blog http://ncbiinsights.ncbi.nlm.nih.gov/. Users may also subscribe to the refseq-announce mail list to receive periodic updates about the project and a summary of the content of each RefSeq FTP release (http://www.ncbi.nlm.nih.gov/mailman/listinfo/refseq-announce/).