(B) Relational Databases
(C) World Wide Web access to databases
(D) The historical problem
(E) Unifying approaches to link databases
================================================================================
NCBI was Established in 1988 as a national resource for
molecular biology information, NCBI creates
public databases, conducts research in
computational biology, develops software
tools for analyzing genome data, and
disseminates biomedical information - all for
the better understanding of molecular
processes affecting human health and
disease.
The NCBI can be summarised as having 3 arms:
In this database, new protein and nucleic acids sequences are deposited by researchers. These sequences are annotated and placed in the sequence database for access and public viewing. The database can also be searched.
The various Sequence Data Bases and PubMed literature Data Base are linked as shown below B. The Resources at NCBI

Remember that if you wish to improve and make your searches efficient and effective, than you will need to read and understand the documents GenBank records and ENTREZHELP
In addition, the database conatins a set of integrated online analysis bioinformatics tools useful for aligning user input sequences based on rRNA secondary structural constraints and for constructing phylogeny. It is also possible to download sequences in the aligned and unaligned forms. The sequences are in GenBank format.
NOTE: KEGG is part of the GenomeNet and Bioinformatics in Japan and also houses a range of on-line tools such as BLAST, CLUSTAL etc and is worth looking through.
(i) Copy the following "GenBank DNA sequence" associated with NCBI database
(You may need to use Control C to copy)
LOCUS AY078053 1275 bp DNA linear BCT 13-FEB-2002
DEFINITION Corbulabacter subterraneus 16S ribosomal RNA gene, partial
sequence.
ACCESSION AY078053
VERSION AY078053
KEYWORDS .
SOURCE Corbulabacter subterraneus.
ORGANISM Corbulabacter subterraneus
Bacteria; Proteobacteria; alpha subdivision; Rhizobiaceae group;
Beijerinckia group; Corbulabacter.
REFERENCE 1 (bases 1 to 1275)
AUTHORS Patel,B.K.C. and Kanso,S.
TITLE Corbulabacter subterraneus gen. nov. sp. nov., a novel bacterium
from the subsurface Great Artesian Basin of Australia thermal
aquifer
JOURNAL Unpublished
REFERENCE 2 (bases 1 to 1275)
AUTHORS Patel,B.K.C. and Kanso,S.
TITLE Direct Submission
JOURNAL Submitted (06-FEB-2002) Biomolecular and Biomedical Sciences,
Griffith University, Nathan, Brisbane 4111, Australia
FEATURES Location/Qualifiers
source 1..1275
/organism="Corbulabacter subterraneus"
/strain="Fai4"
/db_xref="taxon:186651"
/note="type strain; Fai4; ATCC BAA-295; DSM 14364"
rRNA <1..>1275
/product="16S ribosomal RNA"
BASE COUNT 306 a 305 c 413 g 251 t
ORIGIN
1 atcctggctc agaacgaacg ctggcggcag gcttaacaca tgcaagtcga acgggccctt
61 cggggtcagt ggcagacggg tgagtaacac gtgggaacgt gcccttcagt tcggaataac
121 ccagggaaac ttgggctaat accggatacg cccttttggg gaaagattta tcgctgaagg
181 atcggcccgc gtctgattag ctagttggtg gggtaaaggc tcaccaaggc gacgatcagt
241 agctggtctg agaggatgat cagccacact gggactgaga cacggcccag actcctacgg
301 gaggcagcag tggggaatat tggacaatgg gcgcaagcct gatccagcca tgccgcgtga
361 gtgatgaagg ccttagggtt gtaaagctct ttcggcgggg acgataatga cggtacccgc
421 agaagaagcc ccggctaact tcgtgccagc agccgcggta atacgaaggg ggctagcgtt
481 gttcggaatc actgggcgta aagggcgcgt aggcggcttt gtaagtcggg ggtgaaagcc
541 tgtggctcaa ccacagaatt gccttcggat actgcatggc ttgagaccgg aagaggtaag
601 tggaactgcg agtgtagagg tgaaattcgt agatattcgc aagaacaccc agtggcgaag
661 gcggcttact ggtccggatc tgacgctgag gcgcgaaagc gtggggagca aacaggatta
721 gataccctgg tagtccacgc cgtaaacgat gaatgccaac cgttgggcag cttgctgctc
781 agtggcgcag ctaacgcttt aagcattccg cctggggagt acggtcgcaa aattaaaact
841 caaagaaatt gacgggggcc cgcacaagcg gtggagcatg tggtttaatt cgaagcaacg
901 cgcagaacct taccagcctt tgacatgtcc ggtatggatc ctggagacag gttccttcag
961 ttcggctggc cggaacacag gtgctgcatg gctgtcgtca gctcgtgtcg tgagatgttg
1021 ggttaagtcc cgcaacgagc gcaaccctcg cccttagttg ccatcattca gttgggcact
1081 ctaaggggac tgccggtgat aagccgagag gaaggtgggg atgacgtcaa gtcctcatgg
1141 cccttacggg ctgggctaca cacgtgctac aatggcggtg acaatgggca gcgaacccgc
1201 gagggggagc taatcccaaa aagccgtctc agttcggatt gcactctgca actcgagtgc
1261 atgaaggtgg aatcg
//
(ii) Paste the sequence into the following URL Sequence File Format Converter and / or Readseq, which is a more recent version of the Sequence File Format Converter Program. So, the same program can "visualised" in many different ways without the underlying program changing. NOTE: You may need to use Control V to paste the sequence in the two programs.
(iii) Select "GenBank" from the pull down the menu from the INPUT FORMAT section and select "Fasta" format from the OUTPUT FORMAT pull down menu. Note the differences in the input format and output format.
Different bioinformatics software may require different types of file formats. For example, the Phylip format is required for the PHYLIP suite of software. (NOTE FOR STUDENTS - CAN YOU FIND AN EXAMPLE OF THE PHYLIP Sequenec File Format.