在加州大學(xué)伯克力分校進(jìn)行的兩項(xiàng)新研究中,,研究人員發(fā)現(xiàn)基因分布在不同種類(lèi)中的情況異常混亂,,而且原始生物之間分享基因的情況更明顯,,甚至往往共享的不只是幾個(gè)基因,而是整個(gè)基因群,。這兩項(xiàng)研究的論文分別發(fā)表在3月7日的《自然》雜志和近期的《PNAS》上,。
基因在不同物種間流動(dòng)的這種現(xiàn)象就是所謂的基因水平轉(zhuǎn)移(橫向移動(dòng))。之前的研究顯示,,在細(xì)菌中經(jīng)??梢杂^察到這種現(xiàn)象促使致病細(xì)菌可以將抗藥性基因與其它細(xì)菌交換共享,從而導(dǎo)致細(xì)菌抗藥性的快速蔓延,。
此前,,研究人員發(fā)現(xiàn)二種不同種類(lèi)的植物也會(huì)分享基因,但是卻不清楚這種現(xiàn)象有多普遍以及是如何發(fā)生的,。
發(fā)表在PNAS的研究顯示,,加州大學(xué)伯克力分校和勞倫斯伯克力國(guó)家實(shí)驗(yàn)室(LBNL)的研究人員分析了超過(guò)8,000種不同的基因家族,這些基因編碼的蛋白質(zhì)可代表所有生物中數(shù)以百萬(wàn)計(jì)的蛋白質(zhì),。研究人員希望通過(guò)這項(xiàng)浩大的分析研究來(lái)評(píng)估基因水平轉(zhuǎn)移的發(fā)生率,。
他們發(fā)現(xiàn)一半以上最原始的有機(jī)生物——古菌(Archaea),有一個(gè)以上的蛋白質(zhì)基因是通過(guò)基因水平轉(zhuǎn)移途徑獲得的,;而細(xì)菌通過(guò)這種途徑獲得基因的百分率則為30%到50%,;真核生物則只有10%的幾率。
第二項(xiàng)研究發(fā)表于3月7日Nature網(wǎng)絡(luò)版上,。這項(xiàng)研究中,,研究人員發(fā)現(xiàn)了生活在加州酸性紅泥土中的二種細(xì)菌分享了一大群基因。這些基因編碼的蛋白質(zhì)具有合作的功能,,因此研究人員認(rèn)為,,這種特性可以幫助細(xì)菌適應(yīng)同樣類(lèi)型的新環(huán)境。此外,,這項(xiàng)研究也是第一次觀察到一大群基因發(fā)生基因水平轉(zhuǎn)移的現(xiàn)象,。
部分英文全文:
Published online before print March 7, 2007, 10.1073/pnas.0611557104
PNAS | March 13, 2007 | vol. 104 | no. 11 | 4489-4494
Global extent of horizontal gene transfer
In-Geol Choi*, and Sung-Hou Kim
Physical Biosciences Division, Lawrence Berkeley National Laboratory and Department of Chemistry, University of California, Berkeley, CA 94720
Contributed by Sung-Hou Kim, December 28, 2006 (received for review July 1, 2006)
Abstract
Horizontal gene transfer (HGT) is thought to play an important role in the evolution of species and innovation of genomes. There have been many convincing evidences for HGT for specific genes or gene families, but there has been no estimate of the global extent of HGT. Here, we present a method of identifying HGT events within a given protein family and estimate the global extent of HGT in all curated protein domain families (8,000) listed in the Pfam database. The results suggest four conclusions: (i) for all protein domain families in Pfam, the fixation of genes horizontally transferred is not a rampant phenomenon between organisms with substantial phylogenetic separations (1.1–9.7% of Pfam families surveyed at three taxonomic ranges studied show indication of HGT); (ii) however, at the level of domains, >50% of Archaea have one or more protein domains acquired by HGT, and nearly 30–50% of Bacteria did the same when examined at three taxonomic ranges. But, the equivalent value for Eukarya is <10%; (iii) HGT will have very little impact in the construction of organism phylogeny, when the construction methods use whole genomes, large numbers of common genes, or SSU rRNAs; and (iv) there appears to be no strong preference of HGT for protein families of particular cellular or molecular functions.
protein domain family | protein sequence family | lateral gene transfer
One of the new important concepts that emerged from a large number of genomic sequences in the last decade is that of horizontal gene transfer (HGT): gene transfer among organisms of different species. HGT has been found to have occurred in all three domains: Archaea, Bacteria, and Eukarya. The concept of HGT has been evoked to interpret various evolutionary processes ranging from speciation and the adaptation of organisms to uncertainties in phylogenetic inference of the tree of life (1–9). Although HGT has been regarded as a driving force in the innovation and evolution of genomes, especially in prokaryotes, its extent and impact on the evolutionary process and phylogeny of organisms or species remains controversial (8–10).
There have been several methods developed to detect HGT, including (i) difference between gene trees derived from a limited number of gene families and the reference trees such as the small-subunit ribosomal RNA (SSU rRNA) tree (11–13) or whole genome tree (14); (ii) unexpectedly high sequence similarity of a gene from two distant genomes compared with those among homologous genes in closely related genomes (15); and (iii) unusual nucleotide composition or codon usages of a gene compared with the rest of the genes within a genome (16, 17). Many factors affect the detection of HGT, such as lineage-specific gene loss (18, 19), unequal rates of base substitution (1), loss of signal due to amelioration processes (16), and others (1, 15).
It has been suggested that HGT may have been "rampant" in primitive genomes (6, 20), but, for modern organisms, it may not be a dominant factor in speciation, because HGT has less effect on overall genome phylogeny (10, 21).
There have been many convincing evidences for HGT for specific genes or gene families, but there has been no estimate of the global extent of HGT in terms of protein domains. Here, we present a statistical method to identify the member(s) in a protein family that may have joined the family by HGT events and examine the global extent of HGT events for all protein domain families of known curated sequences at various ranges of taxonomic levels.
A protein (sequence) domain is a functionally independent unit in protein sequence. The gene coding for it often behaves like a modular genetic element that transfers within or between genomes, sometimes forming a new gene coding for a multiple domain protein (22–24). Because the fixation of a new gene during evolution depends mostly on its advantage for survival, we focus on HGT of the genetic module coding for the sequence domains, rather than the entire genes. At present, there are 1.2 million curated protein domain sequences from three domains of life (Archaea, Bacteria, and Eukarya) in the Pfam (release 16.0) (25).
英文全文鏈接:http://www.pnas.org/cgi/content/full/104/11/4489
Nature advance online publication 7 March 2007 | doi:10.1038/nature05690; Received 7 November 2006; Accepted 20 February 2007; Published online 7 March 2007
Genome-wide analysis of genetic alterations in acute lymphoblastic leukaemia
Charles G. Mullighan1,6, Salil Goorha1,6, Ina Radtke1, Christopher B. Miller1, Elaine Coustan-Smith2, James D. Dalton1, Kevin Girtman1, Susan Mathew1,7, Jing Ma5, Stanley B. Pounds3, Xiaoping Su5, Ching-Hon Pui2, Mary V. Relling4, William E. Evans4, Sheila A. Shurtleff1 and James R. Downing1
Departments of Pathology,
Oncology,
Biostatistics,
Pharmaceutical Sciences, and the
Hartwell Center for Bioinformatics and Biotechnology, St Jude Children's Research Hospital, Memphis, Tennessee 38105, USA
These authors contributed equally to this work.
Present address: The Department of Pathology & Laboratory Medicine, New York Presbyterian Hospital, Cornell Campus, 525 East 68th Street, F511, New York, New York 10021, USA.
Correspondence to: James R. Downing1 Correspondence and requests for materials should be addressed to J.R.D. (Email: [email protected]).
Abstract
Chromosomal aberrations are a hallmark of acute lymphoblastic leukaemia (ALL) but alone fail to induce leukaemia. To identify cooperating oncogenic lesions, we performed a genome-wide analysis of leukaemic cells from 242 paediatric ALL patients using high-resolution, single-nucleotide polymorphism arrays and genomic DNA sequencing. Our analyses revealed deletion, amplification, point mutation and structural rearrangement in genes encoding principal regulators of B lymphocyte development and differentiation in 40% of B-progenitor ALL cases. The PAX5 gene was the most frequent target of somatic mutation, being altered in 31.7% of cases. The identified PAX5 mutations resulted in reduced levels of PAX5 protein or the generation of hypomorphic alleles. Deletions were also detected in TCF3 (also known as E2A), EBF1, LEF1, IKZF1 (IKAROS) and IKZF3 (AIOLOS). These findings suggest that direct disruption of pathways controlling B-cell development and differentiation contributes to B-progenitor ALL pathogenesis. Moreover, these data demonstrate the power of high-resolution, genome-wide approaches to identify new molecular lesions in cancer.