HUMAN GENOME PROJECT (HGP)
- The entire DNA in the haploid set of chromosomes of an oraganism is called a Genome.
- In Human Genome, DNA is packed in 23 chromosomes.
- Human genome contains about 3×10⁹ bp (base pairs).
- Human Genome Project (1990-2003) was the first mega project for the sequencing of nucleotides and mapping of all the genes in human genome.
- HGP was coordinated by U.S. Department of Energy and the National Institute of Health.
Goals of HGP :-
- Identify all the estimated genes in human DNA.
- Sequencing of 3 billion chemical Base pairs of human DNA.
- Store this information in databases.
- Improve tools for data analysis.
- Transfer related technologies to other sectors.
- Address the ethical, legal and social issues (ELSI) that may arise from the project.
METHODOLOGIES :-
It involves 2 major approaches.
1. Expressed Sequence Tags (ESTs) =
- Focused on identifying all the genes that are expressed as RNA.
2.Sequence annotation =
- Sequencing whole set of genome containing all the coding & non-coding sequence and later assigning different regions in the sequence with functions.
PROCEDURE :-
Isolate DNA from a cell
↓
Convert into random fragments
↓
Clone in a host (bacteria & yeast) using vectors (e.g BAC & YAC) for amplification
↓
Sequencing of fragments using Automated DNA sequencers (using Frederick Sanger method)
↓
Arrange the sequences based on overlapping regions
↓
Alignment of sequences using computer programs.
- Sanger has also developed methods for sequencing of amino acids in proteins.
- DNA is converted to random fragments as there are technical limitations in sequencing very long pieces of DNA.
- HGP was closely associated with Bioinformatics.
- Bioinformatics : Application of computer science and information technology to the field of biology & medicine.
- Of the 24 chromosomes (22 autosomes and X & Y), the last sequenced one is chromosome 1 (May 2006).
- Genetic and physical maps on the genome were generated using information on polymorphism of restriction endonuclease recognition sites and some repetitive DNA sequences (microsatellites).
- DNA sequencing also have been done in bacteria, yeast, Coenorhabditis elegans (a free living non-pathogenic nematode), Drosophila, plants (rice & Arabidopsis), etc.
SALIENT FEATURES OF HUMAN GENOME
- Human genome contains 3164.7 million nucleotide bases.
- Total number of genes = about 30,000.
- Average gene consists of 3000 bases, but sizes vary. Largest known human genome (dystrophin on X-chromosome) contains 2.4 million bases.
- 99.9% nucleotide bases are same in all people. Only 0.1% difference makes every individual unique.
- Functions of over 50% of discovered genes are unknown.
- Chromosome 1 has most genes (2968) and Y has the fewest (231).
- Less than 2% of the genome codes for proteins.
- Very large portion of human genome is made of Repeated (repetitive) sequences. These are stretches of DNA sequences that are repeated many times. They have no direct coding functions. They shed light on chromosome structure, dynamics and evolution.
- About 1.4 million locations have single-base DNA differences. They are called SNPs (Single nucleotide polymorphism or 'snips'). This helps to find chromosomal locations for disease-associated sequences and tracing human history.





Comments
Post a Comment