fbpx
Wikipedia

National Center for Biotechnology Information

The National Center for Biotechnology Information (NCBI)[1][2] is part of the United States National Library of Medicine (NLM), a branch of the National Institutes of Health (NIH). It is approved and funded by the government of the United States. The NCBI is located in Bethesda, Maryland, and was founded in 1988 through legislation sponsored by US Congressman Claude Pepper.

National Center for
Biotechnology Information
AbbreviationNCBI
Founded1988; 35 years ago (1988)
HeadquartersBethesda, Maryland
Location
Coordinates38°59′45″N 77°05′56″W / 38.9959°N 77.0989°W / 38.9959; -77.0989Coordinates: 38°59′45″N 77°05′56″W / 38.9959°N 77.0989°W / 38.9959; -77.0989
Official language
English
Director
Stephen Sherry
(since Sept 26, 2022)
Parent organization
United States National Library of Medicine
AffiliationsNational Institutes of Health
Websitencbi.nlm.nih.gov

The NCBI houses a series of databases relevant to biotechnology and biomedicine and is an important resource for bioinformatics tools and services. Major databases include GenBank for DNA sequences and PubMed, a bibliographic database for biomedical literature. Other databases include the NCBI Epigenomics database. All these databases are available online through the Entrez search engine. NCBI was directed by David Lipman,[2] one of the original authors of the BLAST sequence alignment program[3] and a widely respected figure in bioinformatics.

GenBank

NCBI had responsibility for making available the GenBank DNA sequence database since 1992.[4] GenBank coordinates with individual laboratories and other sequence databases, such as those of the European Molecular Biology Laboratory (EMBL) and the DNA Data Bank of Japan (DDBJ).[4]

Since 1992, NCBI has grown to provide other databases in addition to GenBank. NCBI provides Gene, Online Mendelian Inheritance in Man, the Molecular Modeling Database (3D protein structures), dbSNP (a database of single-nucleotide polymorphisms), the Reference Sequence Collection, a map of the human genome, and a taxonomy browser, and coordinates with the National Cancer Institute to provide the Cancer Genome Anatomy Project. The NCBI assigns a unique identifier (taxonomy ID number) to each species of organism.[5]

The NCBI has software tools that are available through internet browsers or by FTP. For example, BLAST is a sequence similarity searching program. BLAST can do sequence comparisons against the GenBank DNA database in less than 15 seconds.

NCBI Bookshelf

The NCBI Bookshelf[6] is a collection of freely accessible, downloadable, online versions of selected biomedical books. The Bookshelf covers a wide range of topics including molecular biology, biochemistry, cell biology, genetics, microbiology, disease states from a molecular and cellular point of view, research methods, and virology. Some of the books are online versions of previously published books, while others, such as Coffee Break, are written and edited by NCBI staff. The Bookshelf is a complement to the Entrez PubMed repository of peer-reviewed publication abstracts in that Bookshelf contents provide established perspectives on evolving areas of study and a context in which many disparate individual pieces of reported research can be organized.[citation needed]

Basic Local Alignment Search Tool (BLAST)

BLAST is an algorithm used for calculating sequence similarity between biological sequences, such as nucleotide sequences of DNA and amino acid sequences of proteins.[7] BLAST is a powerful tool for finding sequences similar to the query sequence within the same organism or in different organisms. It searches the query sequence on NCBI databases and servers and posts the results back to the person's browser in the chosen format. Input sequences to the BLAST are mostly in FASTA or GenBank format while output could be delivered in a variety of formats such as HTML, XML formatting, and plain text. HTML is the default output format for NCBI's web-page. Results for NCBI-BLAST are presented in graphical format with all the hits found, a table with sequence identifiers for the hits having scoring related data, along with the alignments for the sequence of interest and the hits received with analogous BLAST scores for these.[8]

Entrez

The Entrez Global Query Cross-Database Search System is used at NCBI for all the major databases such as Nucleotide and Protein Sequences, Protein Structures, PubMed, Taxonomy, Complete Genomes, OMIM, and several others.[9] Entrez is both an indexing and retrieval system having data from various sources for biomedical research. NCBI distributed the first version of Entrez in 1991, composed of nucleotide sequences from PDB and GenBank, protein sequences from SWISS-PROT, translated GenBank, PIR, PRF, PDB, and associated abstracts and citations from PubMed. Entrez is specially designed to integrate the data from several different sources, databases, and formats into a uniform information model and retrieval system which can efficiently retrieve that relevant references, sequences, and structures.[10]

Gene

Gene has been implemented at NCBI to characterize and organize the information about genes. It serves as a major node in the nexus of the genomic map, expression, sequence, protein function, structure, and homology data. A unique GeneID is assigned to each gene record that can be followed through revision cycles. Gene records for known or predicted genes are established here and are demarcated by map positions or nucleotide sequences. Gene has several advantages over its predecessor, LocusLink, including, better integration with other databases in NCBI, broader taxonomic scope, and enhanced options for query and retrieval provided by the Entrez system.[11]

Protein

Protein database maintains the text record for individual protein sequences, derived from many different resources such as NCBI Reference Sequence (RefSeq) project, GenBank, PDB, and UniProtKB/SWISS-Prot. Protein records are present in different formats including FASTA and XML and are linked to other NCBI resources. Protein provides the relevant data to the users such as genes, DNA/RNA sequences, biological pathways, expression and variation data, and literature. It also provides the pre-determined sets of similar and identical proteins for each sequence as computed by the BLAST. The Structure database of NCBI contains 3D coordinate sets for experimentally-determined structures in PDB that are imported by NCBI. The Conserved Domain database (CDD) of protein contains sequence profiles that characterize highly conserved domains within protein sequences. It also has records from external resources like SMART and Pfam. There is another database of proteins known as Protein Clusters database, which contains sets of proteins sequences that are clustered according to the maximum alignments between the individual sequences as calculated by BLAST.[12]

Pubchem database

PubChem database of NCBI is a public resource for molecules and their activities against biological assays. PubChem is searchable and accessible by Entrez information retrieval system.[13]

See also

References

  1. ^ "The Human Genome Project". The New York Times.
  2. ^ a b "Research Institute Posts Gene Data on Internet". The New York Times. June 26, 1997.
  3. ^ . 2000. Archived from the original on 2007-10-07.
  4. ^ a b Mizrachi, Ilene (22 August 2007). GenBank: The Nucleotide Sequence Database. National Center for Biotechnology Information (US) – via www.ncbi.nlm.nih.gov.
  5. ^ "Home - Taxonomy - NCBI". www.ncbi.nlm.nih.gov.
  6. ^ USA (2019-05-06). "Home - Books - NCBI". Ncbi.nlm.nih.gov. Retrieved 2019-06-12.
  7. ^ Altschul Stephen; Gish Warren; Miller Webb; Myers Eugene; Lipman David (1990). "Basic local alignment search tool". Journal of Molecular Biology. 215 (3): 403–410. doi:10.1016/s0022-2836(05)80360-2. PMID 2231712. S2CID 14441902.
  8. ^ Madden T. (2002). The NCBI Handbook, 2nd edition, Chapter 16, The BLAST Sequence Analysis Tool
  9. ^ NCBI Resource Coordinators (2012). "Database resources of the National Center for Biotechnology Information". Nucleic Acids Research 41 (Database issue): D8–D20.
  10. ^ Ostell J. (2002). The NCBI Handbook, 2nd edition, Chapter 15, The Entrez Search and Retrieval System
  11. ^ Maglott D., Pruitt K. & Tatusova T. (2005). The NCBI Handbook, 2nd edition, Chapter 19, Gene: A Directory of Genes
  12. ^ Sayers E. (2013). The NCBI Handbook, 2nd edition, NCBI Protein Resources
  13. ^ Wang Y. & Bryant S H. (2014). The NCBI Handbook, 2nd edition, NCBI PubChem BioAssay Database

External links

  • Official website
  • Official website of the National Library of Medicine]
  • Official website of the National Institutes of Health

national, center, biotechnology, information, ncbi, redirects, here, other, uses, ncbi, disambiguation, ncbi, part, united, states, national, library, medicine, branch, national, institutes, health, approved, funded, government, united, states, ncbi, located, . NCBI redirects here For other uses see NCBI disambiguation The National Center for Biotechnology Information NCBI 1 2 is part of the United States National Library of Medicine NLM a branch of the National Institutes of Health NIH It is approved and funded by the government of the United States The NCBI is located in Bethesda Maryland and was founded in 1988 through legislation sponsored by US Congressman Claude Pepper National Center forBiotechnology InformationAbbreviationNCBIFounded1988 35 years ago 1988 HeadquartersBethesda MarylandLocationUnited StatesCoordinates38 59 45 N 77 05 56 W 38 9959 N 77 0989 W 38 9959 77 0989 Coordinates 38 59 45 N 77 05 56 W 38 9959 N 77 0989 W 38 9959 77 0989Official languageEnglishDirectorStephen Sherry since Sept 26 2022 Parent organizationUnited States National Library of MedicineAffiliationsNational Institutes of HealthWebsitencbi nlm nih govThe NCBI houses a series of databases relevant to biotechnology and biomedicine and is an important resource for bioinformatics tools and services Major databases include GenBank for DNA sequences and PubMed a bibliographic database for biomedical literature Other databases include the NCBI Epigenomics database All these databases are available online through the Entrez search engine NCBI was directed by David Lipman 2 one of the original authors of the BLAST sequence alignment program 3 and a widely respected figure in bioinformatics Contents 1 GenBank 2 NCBI Bookshelf 3 Basic Local Alignment Search Tool BLAST 4 Entrez 5 Gene 6 Protein 7 Pubchem database 8 See also 9 References 10 External linksGenBank EditMain article GenBank NCBI had responsibility for making available the GenBank DNA sequence database since 1992 4 GenBank coordinates with individual laboratories and other sequence databases such as those of the European Molecular Biology Laboratory EMBL and the DNA Data Bank of Japan DDBJ 4 Since 1992 NCBI has grown to provide other databases in addition to GenBank NCBI provides Gene Online Mendelian Inheritance in Man the Molecular Modeling Database 3D protein structures dbSNP a database of single nucleotide polymorphisms the Reference Sequence Collection a map of the human genome and a taxonomy browser and coordinates with the National Cancer Institute to provide the Cancer Genome Anatomy Project The NCBI assigns a unique identifier taxonomy ID number to each species of organism 5 The NCBI has software tools that are available through internet browsers or by FTP For example BLAST is a sequence similarity searching program BLAST can do sequence comparisons against the GenBank DNA database in less than 15 seconds NCBI Bookshelf EditThe NCBI Bookshelf 6 is a collection of freely accessible downloadable online versions of selected biomedical books The Bookshelf covers a wide range of topics including molecular biology biochemistry cell biology genetics microbiology disease states from a molecular and cellular point of view research methods and virology Some of the books are online versions of previously published books while others such as Coffee Break are written and edited by NCBI staff The Bookshelf is a complement to the Entrez PubMed repository of peer reviewed publication abstracts in that Bookshelf contents provide established perspectives on evolving areas of study and a context in which many disparate individual pieces of reported research can be organized citation needed Basic Local Alignment Search Tool BLAST EditBLAST is an algorithm used for calculating sequence similarity between biological sequences such as nucleotide sequences of DNA and amino acid sequences of proteins 7 BLAST is a powerful tool for finding sequences similar to the query sequence within the same organism or in different organisms It searches the query sequence on NCBI databases and servers and posts the results back to the person s browser in the chosen format Input sequences to the BLAST are mostly in FASTA or GenBank format while output could be delivered in a variety of formats such as HTML XML formatting and plain text HTML is the default output format for NCBI s web page Results for NCBI BLAST are presented in graphical format with all the hits found a table with sequence identifiers for the hits having scoring related data along with the alignments for the sequence of interest and the hits received with analogous BLAST scores for these 8 Entrez EditThe Entrez Global Query Cross Database Search System is used at NCBI for all the major databases such as Nucleotide and Protein Sequences Protein Structures PubMed Taxonomy Complete Genomes OMIM and several others 9 Entrez is both an indexing and retrieval system having data from various sources for biomedical research NCBI distributed the first version of Entrez in 1991 composed of nucleotide sequences from PDB and GenBank protein sequences from SWISS PROT translated GenBank PIR PRF PDB and associated abstracts and citations from PubMed Entrez is specially designed to integrate the data from several different sources databases and formats into a uniform information model and retrieval system which can efficiently retrieve that relevant references sequences and structures 10 Gene EditGene has been implemented at NCBI to characterize and organize the information about genes It serves as a major node in the nexus of the genomic map expression sequence protein function structure and homology data A unique GeneID is assigned to each gene record that can be followed through revision cycles Gene records for known or predicted genes are established here and are demarcated by map positions or nucleotide sequences Gene has several advantages over its predecessor LocusLink including better integration with other databases in NCBI broader taxonomic scope and enhanced options for query and retrieval provided by the Entrez system 11 Protein EditProtein database maintains the text record for individual protein sequences derived from many different resources such as NCBI Reference Sequence RefSeq project GenBank PDB and UniProtKB SWISS Prot Protein records are present in different formats including FASTA and XML and are linked to other NCBI resources Protein provides the relevant data to the users such as genes DNA RNA sequences biological pathways expression and variation data and literature It also provides the pre determined sets of similar and identical proteins for each sequence as computed by the BLAST The Structure database of NCBI contains 3D coordinate sets for experimentally determined structures in PDB that are imported by NCBI The Conserved Domain database CDD of protein contains sequence profiles that characterize highly conserved domains within protein sequences It also has records from external resources like SMART and Pfam There is another database of proteins known as Protein Clusters database which contains sets of proteins sequences that are clustered according to the maximum alignments between the individual sequences as calculated by BLAST 12 Pubchem database EditPubChem database of NCBI is a public resource for molecules and their activities against biological assays PubChem is searchable and accessible by Entrez information retrieval system 13 See also EditDNA Data Bank of Japan DDBJ European Bioinformatics Institute EBI References Edit The Human Genome Project The New York Times a b Research Institute Posts Gene Data on Internet The New York Times June 26 1997 Sense from Sequences Stephen F Altschul on Bettering BLAST 2000 Archived from the original on 2007 10 07 a b Mizrachi Ilene 22 August 2007 GenBank The Nucleotide Sequence Database National Center for Biotechnology Information US via www ncbi nlm nih gov Home Taxonomy NCBI www ncbi nlm nih gov USA 2019 05 06 Home Books NCBI Ncbi nlm nih gov Retrieved 2019 06 12 Altschul Stephen Gish Warren Miller Webb Myers Eugene Lipman David 1990 Basic local alignment search tool Journal of Molecular Biology 215 3 403 410 doi 10 1016 s0022 2836 05 80360 2 PMID 2231712 S2CID 14441902 Madden T 2002 The NCBI Handbook 2nd edition Chapter 16 The BLAST Sequence Analysis Tool NCBI Resource Coordinators 2012 Database resources of the National Center for Biotechnology Information Nucleic Acids Research 41 Database issue D8 D20 Ostell J 2002 The NCBI Handbook 2nd edition Chapter 15 The Entrez Search and Retrieval System Maglott D Pruitt K amp Tatusova T 2005 The NCBI Handbook 2nd edition Chapter 19 Gene A Directory of Genes Sayers E 2013 The NCBI Handbook 2nd edition NCBI Protein Resources Wang Y amp Bryant S H 2014 The NCBI Handbook 2nd edition NCBI PubChem BioAssay DatabaseExternal links Edit Wikidata has the property NCBI taxonomy ID P685 see uses Official website Official website of the National Library of Medicine Official website of the National Institutes of Health Retrieved from https en wikipedia org w index php title National Center for Biotechnology Information amp oldid 1139268344, wikipedia, wiki, book, books, library,

article

, read, download, free, free download, mp3, video, mp4, 3gp, jpg, jpeg, gif, png, picture, music, song, movie, book, game, games.