假基因

假基因（Pseudogenes，Pseudo-意为“假”）是一类染色体上的基因片段。假基因的序列通常与对应的基因相似，但至少是丧失了一部分功能，如基因不能表达或编码的蛋白质没有功能^[3]。

一般认为，假基因最初是功能对生物生存并非必要的基因。随着突变的积累，出现编码区提前出现终止密码子、移码突变（英语：Translational frameshift）等情况，逐渐变为无功能的假基因。另外，拷贝数变异（英语：Copy-number variation）（Copy-number variation, CNV）也可能产生假基因。在拷贝数变异中，1kb（千碱基对）以上的DNA片段会发生复制或删除^[4]。一部分假基因既没有内含子，也没有启动子（这种启动子被认为是通过mRNA的逆转录转移到染色体上的，称为“加工”假基因（processed pseudogenes））^[5]，但部分假基因仍然拥有一些与正常基因相同的特征，比如拥有CpG岛等启动子、RNA剪接位点等。

假基因这一名词是由雅克（Jacq）等人于1977年最早提出的^[6]。长期以来生物学家们认为假基因是没有功能的垃圾DNA，惟近年来的研究还表明假基因和其他非编码片段一样，拥有调控基因表达的功能。假基因的调控作用对维持生物体的生理活动有着重要意义，一部分假基因在某些疾病的发展中也扮演着重要角色^[7]。

在进化生物学研究中，这些因为演化而丧失功能的假基因，对他们进行序列分析意义则相对重大，一直是研究者获知生物进化历程的手段。假基因一般会拥有一些源基因的特征。按照进化论的观点，两个亲缘关系较近的物种拥有同一祖先。对假基因进行序列比对、分析，即可验证两物种是否拥有同一祖先，并能计算出两物种开始分离的时间（结果能精确到百万年）。

已隐藏部分未翻译内容，欢迎参与翻译。

假基因通常以与已知基因的同源性和某些功能丧失的组合为特征。也就是说，尽管每个假基因都具有与某些功能基因相似的DNA序列，但它们通常无法产生功能性的最终蛋白质产物。由于同源性和功能丧失的两个要求，假基因有时难以在基因组中鉴定和表征。通常是通过序列比对而不是生物学上证实的。

Homology is implied by sequence identity between the DNA sequences of the pseudogene and parent gene. After aligning the two sequences, the percentage of identical base pairs is computed. A high sequence identity means that it is highly likely that these two sequences diverged from a common ancestral sequence (are homologous), and highly unlikely that these two sequences have evolved independently (see Convergent evolution).
Nonfunctionality can manifest itself in many ways. Normally, a gene must go through several steps to a fully functional protein: Transcription, pre-mRNA processing, translation, and protein folding are all required parts of this process. If any of these steps fails, then the sequence may be considered nonfunctional. In high-throughput pseudogene identification, the most commonly identified disablements are premature stop codons and frameshifts, which almost universally prevent the translation of a functional protein product.

Pseudogenes for RNA genes are usually more difficult to discover as they do not need to be translated and thus do not have "reading frames".

Pseudogenes can complicate molecular genetic studies. For example, amplification of a gene by PCR may simultaneously amplify a pseudogene that shares similar sequences. This is known as PCR bias or amplification bias. Similarly, pseudogenes are sometimes annotated as genes in genome sequences.

处理过的假基因经常给基因预测程序带来问题，经常被误认为是真实的基因或外显子。有人提出，识别加工过的假基因可以帮助提高尔基因预测方法的准确性。^[8]

最近，140 个人类假基因被证明可以被翻译。^[9]但其蛋白质产物的功能尚不清楚。

根据不同的起源机制和特点，假基因可大致分为如下四类：经处理的假基因 (Processed)、未经处理的假基因 (Non-processed)、单套假基因 (Unitary pseudogenes)、假的假基因 (Pseudo-pseudogenes)。

已隐藏部分未翻译内容，欢迎参与翻译。

Processed

Processed (or retrotransposed) pseudogenes. In higher eukaryotes, particularly mammals, retrotransposition is a fairly common event that has had a huge impact on the composition of the genome. For example, somewhere between 30–44% of the human genome consists of repetitive elements such as SINEs and LINEs (see retrotransposons).^[10]^[11] In the process of retrotransposition, a portion of the mRNA or hnRNA transcript of a gene is spontaneously reverse transcribed back into DNA and inserted into chromosomal DNA. Although retrotransposons usually create copies of themselves, it has been shown in an in vitro system that they can create retrotransposed copies of random genes, too.^[12] Once these pseudogenes are inserted back into the genome, they usually contain a poly-A tail, and usually have had their introns spliced out; these are both hallmark features of cDNAs. However, because they are derived from an RNA product, processed pseudogenes also lack the upstream promoters of normal genes; thus, they are considered "dead on arrival", becoming non-functional pseudogenes immediately upon the retrotransposition event.^[13] However, these insertions occasionally contribute exons to existing genes, usually via alternatively spliced transcripts.^[14] A further characteristic of processed pseudogenes is common truncation of the 5' end relative to the parent sequence, which is a result of the relatively non-processive retrotransposition mechanism that creates processed pseudogenes.^[15] Processed pseudogenes are continually being created in primates.^[16] Human populations, for example, have distinct sets of processed pseudogenes across its individuals.^[17]

Non-processed

Non-processed (or duplicated) pseudogenes. Gene duplication is another common and important process in the evolution of genomes. A copy of a functional gene may arise as a result of a gene duplication event caused by homologous recombination at, for example, repetitive sine sequences on misaligned chromosomes and subsequently acquire mutations that cause the copy to lose the original gene's function. Duplicated pseudogenes usually have all the same characteristics as genes, including an intact exon-intron structure and regulatory sequences. The loss of a duplicated gene's functionality usually has little effect on an organism's fitness, since an intact functional copy still exists. According to some evolutionary models, shared duplicated pseudogenes indicate the evolutionary relatedness of humans and the other primates.^[18] If pseudogenization is due to gene duplication, it usually occurs in the first few million years after the gene duplication, provided the gene has not been subjected to any selection pressure.^[19] Gene duplication generates functional redundancy and it is not normally advantageous to carry two identical genes. Mutations that disrupt either the structure or the function of either of the two genes are not deleterious and will not be removed through the selection process. As a result, the gene that has been mutated gradually becomes a pseudogene and will be either unexpressed or functionless. This kind of evolutionary fate is shown by population genetic modeling^[20]^[21] and also by genome analysis.^[19]^[22] According to evolutionary context, these pseudogenes will either be deleted or become so distinct from the parental genes so that they will no longer be identifiable. Relatively young pseudogenes can be recognized due to their sequence similarity.^[23]

Unitary pseudogenes

Various mutations (such as indels and nonsense mutations) can prevent a gene from being normally transcribed or translated, and thus the gene may become less- or non-functional or "deactivated". These are the same mechanisms by which non-processed genes become pseudogenes, but the difference in this case is that the gene was not duplicated before pseudogenization. Normally, such a pseudogene would be unlikely to become fixed in a population, but various population effects, such as genetic drift, a population bottleneck, or, in some cases, natural selection, can lead to fixation. The classic example of a unitary pseudogene is the gene that presumably coded the enzyme L-gulono-γ-lactone oxidase (GULO) in primates. In all mammals studied besides primates (except guinea pigs), GULO aids in the biosynthesis of ascorbic acid (vitamin C), but it exists as a disabled gene (GULOP) in humans and other primates.^[24]^[25] Another more recent example of a disabled gene links the deactivation of the caspase 12 gene (through a nonsense mutation) to positive selection in humans.^[26]

It has been shown that processed pseudogenes accumulate mutations faster than non-processed pseudogenes.^[27]

Pseudo-pseudogenes

The rapid proliferation of DNA sequencing technologies has led to the identification of many apparent pseudogenes using gene prediction techniques. Pseudogenes are often identified by the appearance of a premature stop codon in a predicted mRNA sequence, which would, in theory, prevent synthesis (translation) of the normal protein product of the original gene. There have been some reports of translational readthrough of such premature stop codons in mammals, as reviewed in the "Translational readthrough" section of the stop codon article. As alluded to in the figure above, a small amount of the protein product of such readthrough may still be recognizable and function at some level. If so, the pseudogene can be subject to natural selection. That appears to have happened during the evolution of Drosophila species, as described next.

In 2016 it was reported that 4 predicted pseudogenes in multiple Drosophila species actually encode proteins with biologically important functions,^[28] "suggesting that such 'pseudo-pseudogenes' could represent a widespread phenomenon". For example, the functional protein (an olfactory receptor) is found only in neurons. This finding of tissue-specific biologically-functional genes that could have been dismissed as pseudogenes by in silico analysis complicates the analysis of sequence data. As of 2012, it appeared that there are approximately 12,000–14,000 pseudogenes in the human genome,^[29] almost comparable to the oft-cited approximate value of 20,000 genes in our genome. The current work may also help to explain why we are able to live with 20 to 100 putative homozygous loss of function mutations in our genomes.^[30]

Through reanalysis of over 50 million peptides generated from the human proteome and separated by mass spectrometry, it now (2016) appears that there are at least 19,262 human proteins produced from 16,271 genes or clusters of genes. From this analysis, 8 new protein coding genes that were previously considered pseudogenes were identified.^[31]

假基因功能的例子

The term "pseudo-pseudogene" was coined in the publication that investigated the gene in the chemosensory ionotropic glutamate receptor Ir75a of Drosophila sechellia, which bears a premature termination codon (PTC) and was thus classified as a pseudogene based on that in silico analysis. However, in vivo the D. sechellia Ir75a locus produces a functional receptor, owing to translational read-through of the PTC. Read-through is detected only in neurons and depends on the nucleotide sequence downstream of the PTC.^[28]
The Drosophila jingwei gene produces a functional alcohol dehydrogenase enzyme in vivo.^[32] However, previous in silico analysis classified it as a processed pseudogene.^[33] The evolution of this gene has been discussed.^[34]
A human processed pseudogene of phosphoglycerate mutase was initially reported by interpretation of both in silico and experimental evidence.^[35] That pseudogene was investigated more fully by another group, which found convincing evidence that it was a functional gene,^[36] which is now named PGAM4. The gene is expressed in the testes and polymorphisms in that gene appear to account for about 5% of cases of male infertility.^[37]
siRNAs. Some endogenous siRNAs appear to be derived from pseudogenes, and thus some pseudogenes play a role in regulating protein-coding transcripts, as reviewed.^[38] One of the many examples is psiPPM1K. Processing of RNAs transcribed from psiPPM1K yield siRNAs that can act to suppress the most common type of liver cancer, hepatocellular carcinoma.^[39] This and much other research has led to considerable excitement about the possibility of targeting pseudogenes with/as therapeutic agents^[40]
Some piRNAs are derived from pseudogenes located in piRNA clusters.^[41] Those piRNAs regulate genes via the piRNA pathway in mammalian testes and are crucial for limiting transposable element damage to the genome.^[42]
BRAF pseudogene acts as a ceRNA
There are many reports of pseudogene transcripts acting as microRNA decoys. Perhaps the earliest definitive example of such a pseudogene involved in cancer is the pseudogene of BRAF. The BRAF gene is a proto-oncogene that, when mutated, is associated with many cancers. Normally, the amount of BRAF protein is kept under control in cells through the action of miRNA. In normal situations, the amount of RNA from BRAF and the pseudogene BRAFP1 compete for miRNA, but the balance of the 2 RNAs is such that cells grow normally. However, when BRAFP1 RNA expression is increased (either experimentally or by natural mutations), less miRNA is available to control the expression of BRAF, and the increased amount of BRAF protein causes cancer.^[43] This sort of competition for regulatory elements by RNAs that are endogenous to the genome has given rise to the term ceRNA.
The PTEN gene is a known tumor suppressor gene. The PTEN pseudogene, PTENP1 is a processed pseudogene that is very similar in its genetic sequence to the wild-type gene. However, PTENP1 has a missense mutation which eliminates the codon for the initiating methionine and thus prevents translation of the normal PTEN protein.^[44] In spite of that, PTENP1 appears to play a role in oncogenesis. The 3' UTR of PTENP1 mRNA functions as a decoy of PTEN mRNA by targeting micro RNAs due to its similarity to the PTEN gene, and overexpression of the 3' UTR resulted in an increase of PTEN protein level.^[45] That is, overexpression of the PTENP1 3' UTR leads to increased regulation and suppression of cancerous tumors. The biology of this system is basically the inverse of the BRAF system described above.
Pseudogenes can, over evolutionary time scales, participate in gene conversion and other mutational events that may give rise to new or newly-functional genes. This has led to the concept, used in a major review from 2003, that pseudogenes could be viewed as potogenes: potential genes for evolutionary diversification.^[46]

细菌基因组中也存在假基因^[47]。这些拥有假基因的细菌通常为共生或细胞内寄生，因此它们不需要一些生活在外界复杂环境中的细菌所必须的基因。一个极端的例子是麻风病的病原体--麻风杆菌（Mycobacterium leprae）的基因组，已报道有1,133个假基因约占其转录组的50%^[48]。

[1]
Max EE. Plagiarized Errors and Molecular Genetics. Creation Evolution Journal. 1986, 6 (3): 34–46 [2017-10-19]. （原始内容存档于2019-03-01）.
[2]
Chandrasekaran C, Betrán E. Origins of new genes and pseudogenes.. Nature Education. 2008, 1 (1): 181 [2017-10-19]. （原始内容存档于2020-11-22）.
[3]
Vanin EF. Processed pseudogenes: characteristics and evolution. Annual Review of Genetics. 1985, 19: 253–72. PMID 3909943. doi:10.1146/annurev.ge.19.120185.001345.
↑
Chang Y, Stuart A, et al. Antigen presenting genes and genomic copy number variations in the Tasmanian devil MHC. BMC Genomics. 2012, 13:87. doi:10.1186/1471-2164-13-87.
[5]
Herron JC, Freeman S. Evolutionary analysis 4th. Upper Saddle River, NJ: Pearson Prentice Hall. 2007. ISBN 978-0-13-227584-2.
[6]
Jacq C, Miller JR, Brownlee GG. A pseudogene structure in 5S DNA of Xenopus laevis. Cell. September 1977, 12 (1): 109–20. PMID 561661. doi:10.1016/0092-8674(77)90189-1.
[7]
Xiao-Jie L, Ai-Mei G, Li-Juan J, Jiang X. Pseudogene in cancer: real functions and promising signature. Journal of Medical Genetics. January 2015, 52 (1): 17–24. PMID 25391452. doi:10.1136/jmedgenet-2014-102785.
[8]
van Baren MJ, Brent MR. Iterative gene prediction and pseudogene removal improves genome annotation. Genome Research. May 2006, 16 (5): 678–85. PMC 1457044 . PMID 16651666. doi:10.1101/gr.4766206.
[9]
Kim, MS; et al. A draft map of the human proteome.. Nature. 2014, 509: 575–581. PMC 4403737 . PMID 24870542. doi:10.1038/nature13302.
[10]
Jurka J. Evolutionary impact of human Alu repetitive elements. Current Opinion in Genetics & Development. December 2004, 14 (6): 603–8. PMID 15531153. doi:10.1016/j.gde.2004.08.008.
[11]
Dewannieux M, Heidmann T. LINEs, SINEs and processed pseudogenes: parasitic strategies for genome modeling. Cytogenetic and Genome Research. 2005, 110 (1–4): 35–48. PMID 16093656. doi:10.1159/000084936.
[12]
Dewannieux M, Esnault C, Heidmann T. LINE-mediated retrotransposition of marked Alu sequences. Nature Genetics. September 2003, 35 (1): 41–8. PMID 12897783. doi:10.1038/ng1223.
[13]
Graur D, Shuali Y, Li WH. Deletions in processed pseudogenes accumulate faster in rodents than in humans. Journal of Molecular Evolution. April 1989, 28 (4): 279–85. PMID 2499684. doi:10.1007/BF02103423.
[14]
Baertsch R, Diekhans M, Kent WJ, Haussler D, Brosius J. Retrocopy contributions to the evolution of the human genome. BMC Genomics. October 2008, 9: 466. PMC 2584115 . PMID 18842134. doi:10.1186/1471-2164-9-466.
[15]
Pavlícek A, Paces J, Zíka R, Hejnar J. Length distribution of long interspersed nucleotide elements (LINEs) and processed pseudogenes of human endogenous retroviruses: implications for retrotransposition and pseudogene detection. Gene. October 2002, 300 (1–2): 189–94. PMID 12468100. doi:10.1016/S0378-1119(02)01047-8.
[16]
Navarro FC, Galante PA. A Genome-Wide Landscape of Retrocopies in Primate Genomes. Genome Biology and Evolution. July 2015, 7 (8): 2265–75. PMC 4558860 . PMID 26224704. doi:10.1093/gbe/evv142.
[17]
Schrider DR, Navarro FC, Galante PA, Parmigiani RB, Camargo AA, Hahn MW, de Souza SJ. Gene copy-number polymorphism caused by retrotransposition in humans. PLoS Genetics. 2013-01-24, 9 (1): e1003242. PMC 3554589 . PMID 23359205. doi:10.1371/journal.pgen.1003242.
[18]
Max EE. Plagiarized Errors and Molecular Genetics. TalkOrigins Archive. 2003-05-05 [2008-07-22]. （原始内容存档于2020-11-12）.
[19]
Lynch M, Conery JS. The evolutionary fate and consequences of duplicate genes. Science. November 2000, 290 (5494): 1151–5. Bibcode:2000Sci...290.1151L. PMID 11073452. doi:10.1126/science.290.5494.1151.
[20]
Walsh JB. How often do duplicated genes evolve new functions?. Genetics. January 1995, 139 (1): 421–8. PMC 1206338 . PMID 7705642.
[21]
Lynch M, O'Hely M, Walsh B, Force A. The probability of preservation of a newly arisen gene duplicate. Genetics. December 2001, 159 (4): 1789–804. PMC 1461922 . PMID 11779815.
[22]
Harrison PM, Hegyi H, Balasubramanian S, Luscombe NM, Bertone P, Echols N, Johnson T, Gerstein M. Molecular fossils in the human genome: identification and analysis of the pseudogenes in chromosomes 21 and 22. Genome Research. February 2002, 12 (2): 272–80. PMC 155275 . PMID 11827946. doi:10.1101/gr.207102.
[23]
Zhang J. Evolution by gene duplication: an update.. Trends in Ecology and Evolution. 2003, 18 (6): 292–298. doi:10.1016/S0169-5347(03)00033-8.
[24]
Nishikimi M, Kawai T, Yagi K. Guinea pigs possess a highly mutated gene for L-gulono-gamma-lactone oxidase, the key enzyme for L-ascorbic acid biosynthesis missing in this species. The Journal of Biological Chemistry. October 1992, 267 (30): 21967–72. PMID 1400507.
[25]
Nishikimi M, Fukuyama R, Minoshima S, Shimizu N, Yagi K. Cloning and chromosomal mapping of the human nonfunctional gene for L-gulono-gamma-lactone oxidase, the enzyme for L-ascorbic acid biosynthesis missing in man. The Journal of Biological Chemistry. May 1994, 269 (18): 13685–8. PMID 8175804.
[26]
Xue Y, Daly A, Yngvadottir B, Liu M, Coop G, Kim Y, Sabeti P, Chen Y, Stalker J, Huckle E, Burton J, Leonard S, Rogers J, Tyler-Smith C. Spread of an inactive form of caspase-12 in humans is due to recent positive selection. American Journal of Human Genetics. April 2006, 78 (4): 659–70. PMC 1424700 . PMID 16532395. doi:10.1086/503116.
[27]
Zheng D, Frankish A, Baertsch R, Kapranov P, Reymond A, Choo SW, Lu Y, Denoeud F, Antonarakis SE, Snyder M, Ruan Y, Wei CL, Gingeras TR, Guigó R, Harrow J, Gerstein MB. Pseudogenes in the ENCODE regions: consensus annotation, analysis of transcription, and evolution. Genome Research. June 2007, 17 (6): 839–51. PMC 1891343 . PMID 17568002. doi:10.1101/gr.5586307.
[28]
Prieto-Godino LL, Rytz R, Bargeton B, Abuin L, Arguello JR, Peraro MD, Benton R. Olfactory receptor pseudo-pseudogenes. Nature. November 2016, 539 (7627): 93–97. PMC 5164928 . PMID 27776356. doi:10.1038/nature19824.
[29]
Pei B, Sisu C, Frankish A, Howald C, Habegger L, Mu XJ, Harte R, Balasubramanian S, Tanzer A, Diekhans M, Reymond A, Hubbard TJ, Harrow J, Gerstein MB. The GENCODE pseudogene resource. Genome Biology. September 2012, 13 (9): R51. PMC 3491395 . PMID 22951037. doi:10.1186/gb-2012-13-9-r51.
[30]
MacArthur DG, Balasubramanian S, Frankish A, Huang N, Morris J, Walter K, et al. A systematic survey of loss-of-function variants in human protein-coding genes. Science. February 2012, 335 (6070): 823–8. PMC 3299548 . PMID 22344438. doi:10.1126/science.1215040.
[31]
Wright JC, Mudge J, Weisser H, Barzine MP, Gonzalez JM, Brazma A, Choudhary JS, Harrow J. Improving GENCODE reference gene annotation using a high-stringency proteogenomics workflow. Nature Communications. June 2016, 7: 11778. PMC 4895710 . PMID 27250503. doi:10.1038/ncomms11778.
[32]
Long M, Langley CH. Natural selection and the origin of jingwei, a chimeric processed functional gene in Drosophila. Science. April 1993, 260 (5104): 91–5. Bibcode:1993Sci...260...91L. PMID 7682012. doi:10.1126/science.7682012.
[33]
Jeffs P, Ashburner M. Processed pseudogenes in Drosophila. Proceedings. Biological Sciences. May 1991, 244 (1310): 151–9. PMID 1679549. doi:10.1098/rspb.1991.0064.
[34]
Wang W, Zhang J, Alvarez C, Llopart A, Long M. The origin of the Jingwei gene and the complex modular structure of its parental gene, yellow emperor, in Drosophila melanogaster. Molecular Biology and Evolution. September 2000, 17 (9): 1294–301. PMID 10958846. doi:10.1093/oxfordjournals.molbev.a026413.
[35]
Dierick HA, Mercer JF, Glover TW. A phosphoglycerate mutase brain isoform (PGAM 1) pseudogene is localized within the human Menkes disease gene (ATP7 A). Gene. October 1997, 198 (1–2): 37–41. PMID 9370262. doi:10.1016/s0378-1119(97)00289-8.
[36]
Betrán E, Wang W, Jin L, Long M. Evolution of the phosphoglycerate mutase processed gene in human and chimpanzee revealing the origin of a new primate gene. Molecular Biology and Evolution. May 2002, 19 (5): 654–63. PMID 11961099. doi:10.1093/oxfordjournals.molbev.a004124.
[37]
Okuda H, Tsujimura A, Irie S, Yamamoto K, Fukuhara S, Matsuoka Y, Takao T, Miyagawa Y, Nonomura N, Wada M, Tanaka H. A single nucleotide polymorphism within the novel sex-linked testis-specific retrotransposed PGAM4 gene influences human male fertility. PloS One. 2012, 7 (5): e35195. PMC 3348931 . PMID 22590500. doi:10.1371/journal.pone.0035195.
[38]
Chan WL, Chang JG. Pseudogene-derived endogenous siRNAs and their function. Methods in Molecular Biology. 2014, 1167: 227–39. PMID 24823781. doi:10.1007/978-1-4939-0835-6_15.
[39]
Chan WL, Yuo CY, Yang WK, Hung SY, Chang YS, Chiu CC, Yeh KT, Huang HD, Chang JG. Transcribed pseudogene ψPPM1K generates endogenous siRNA to suppress oncogenic cell growth in hepatocellular carcinoma. Nucleic Acids Research. April 2013, 41 (6): 3734–47. PMC 3616710 . PMID 23376929. doi:10.1093/nar/gkt047.
[40]
Roberts TC, Morris KV. Not so pseudo anymore: pseudogenes as therapeutic targets. Pharmacogenomics. December 2013, 14 (16): 2023–34. PMC 4068744 . PMID 24279857. doi:10.2217/pgs.13.172.
[41]
Olovnikov I, Le Thomas A, Aravin AA. A framework for piRNA cluster manipulation. Methods in Molecular Biology. 2014, 1093: 47–58. PMID 24178556. doi:10.1007/978-1-62703-694-8_5.
[42]
Siomi MC, Sato K, Pezic D, Aravin AA. PIWI-interacting small RNAs: the vanguard of genome defence. Nature Reviews Molecular Cell Biology. April 2011, 12 (4): 246–58. PMID 21427766. doi:10.1038/nrm3089.
[43]
Karreth FA, Reschke M, Ruocco A, Ng C, Chapuy B, Léopold V, Sjoberg M, Keane TM, Verma A, Ala U, Tay Y, Wu D, Seitzer N, Velasco-Herrera Mdel C, Bothmer A, Fung J, Langellotto F, Rodig SJ, Elemento O, Shipp MA, Adams DJ, Chiarle R, Pandolfi PP. The BRAF pseudogene functions as a competitive endogenous RNA and induces lymphoma in vivo. Cell. April 2015, 161 (2): 319–32. PMID 25843629. doi:10.1016/j.cell.2015.02.043.
[44]
Dahia PL, FitzGerald MG, Zhang X, Marsh DJ, Zheng Z, Pietsch T, von Deimling A, Haluska FG, Haber DA, Eng C. A highly conserved processed PTEN pseudogene is located on chromosome band 9p21. Oncogene. May 1998, 16 (18): 2403–6. PMID 9620558. doi:10.1038/sj.onc.1201762.
[45]
Poliseno L, Salmena L, Zhang J, Carver B, Haveman WJ, Pandolfi PP. A coding-independent function of gene and pseudogene mRNAs regulates tumour biology. Nature. June 2010, 465 (7301): 1033–8. PMC 3206313 . PMID 20577206. doi:10.1038/nature09144.
[46]
Balakirev ES, Ayala FJ. Pseudogenes: are they "junk" or functional DNA?. Annual Review of Genetics. 2003, 37: 123–51. PMID 14616058. doi:10.1146/annurev.genet.37.040103.103949.
[47]
Goodhead I, Darby AC. Taking the pseudo out of pseudogenes. Current Opinion in Microbiology. February 2015, 23: 102–9. PMID 25461580. doi:10.1016/j.mib.2014.11.012.
[48]
Williams DL, Slayden RA, Amin A, Martinez AN, Pittman TL, Mira A, Mitra A, Nagaraja V, Morrison NE, Moraes M, Gillis TP. Implications of high level pseudogene transcription in Mycobacterium leprae. BMC Genomics. August 2009, 10: 397. PMC 2753549 . PMID 19706172. doi:10.1186/1471-2164-10-397.

Gerstein M, Zheng D. The real life of pseudogenes. Scientific American. August 2006, 295 (2): 48–55. Bibcode:2006SciAm.295b..48G. PMID 16866288. doi:10.1038/scientificamerican0806-48.
Torrents D, Suyama M, Zdobnov E, Bork P. A genome-wide survey of human pseudogenes. Genome Research. December 2003, 13 (12): 2559–67. PMC 403797 . PMID 14656963. doi:10.1101/gr.1455503.
Bischof JM, Chiang AP, Scheetz TE, Stone EM, Casavant TL, Sheffield VC, Braun TA. Genome-wide identification of pseudogenes capable of disease-causing gene conversion. Human Mutation. June 2006, 27 (6): 545–52. PMID 16671097. doi:10.1002/humu.20335.

Pseudogene interaction database, miRNA-pseudogene and protein-pseudogene interaction maps database
Yale University pseudogene database
Hoppsigen database (homologous processed pseudogenes)

[Max_1986-1] [1]
Max EE. Plagiarized Errors and Molecular Genetics. Creation Evolution Journal. 1986, 6 (3): 34–46 [2017-10-19]. （原始内容存档于2019-03-01）.

[Chandrasekaran_2008-2] [2]
Chandrasekaran C, Betrán E. Origins of new genes and pseudogenes.. Nature Education. 2008, 1 (1): 181 [2017-10-19]. （原始内容存档于2020-11-22）.

[Biomed10.1186-3] [3]
Vanin EF. Processed pseudogenes: characteristics and evolution. Annual Review of Genetics. 1985, 19: 253–72. PMID 3909943. doi:10.1146/annurev.ge.19.120185.001345.

[0-4] 
Chang Y, Stuart A, et al. Antigen presenting genes and genomic copy number variations in the Tasmanian devil MHC. BMC Genomics. 2012, 13:87. doi:10.1186/1471-2164-13-87.

[5] [5]
Herron JC, Freeman S. Evolutionary analysis 4th. Upper Saddle River, NJ: Pearson Prentice Hall. 2007. ISBN 978-0-13-227584-2.

[pmid561661-6] [6]
Jacq C, Miller JR, Brownlee GG. A pseudogene structure in 5S DNA of Xenopus laevis. Cell. September 1977, 12 (1): 109–20. PMID 561661. doi:10.1016/0092-8674(77)90189-1.

[pmid25391452-7] [7]
Xiao-Jie L, Ai-Mei G, Li-Juan J, Jiang X. Pseudogene in cancer: real functions and promising signature. Journal of Medical Genetics. January 2015, 52 (1): 17–24. PMID 25391452. doi:10.1136/jmedgenet-2014-102785.

[Van_Baren_Brent_2006-8] [8]
van Baren MJ, Brent MR. Iterative gene prediction and pseudogene removal improves genome annotation. Genome Research. May 2006, 16 (5): 678–85. PMC 1457044 . PMID 16651666. doi:10.1101/gr.4766206.

[9] [9]
Kim, MS; et al. A draft map of the human proteome.. Nature. 2014, 509: 575–581. PMC 4403737 . PMID 24870542. doi:10.1038/nature13302.

[Jurka_2004-10] [10]
Jurka J. Evolutionary impact of human Alu repetitive elements. Current Opinion in Genetics & Development. December 2004, 14 (6): 603–8. PMID 15531153. doi:10.1016/j.gde.2004.08.008.

[Dewannieux_2005-11] [11]
Dewannieux M, Heidmann T. LINEs, SINEs and processed pseudogenes: parasitic strategies for genome modeling. Cytogenetic and Genome Research. 2005, 110 (1–4): 35–48. PMID 16093656. doi:10.1159/000084936.

[Dewannieux_2003-12] [12]
Dewannieux M, Esnault C, Heidmann T. LINE-mediated retrotransposition of marked Alu sequences. Nature Genetics. September 2003, 35 (1): 41–8. PMID 12897783. doi:10.1038/ng1223.

[Graur_1989-13] [13]
Graur D, Shuali Y, Li WH. Deletions in processed pseudogenes accumulate faster in rodents than in humans. Journal of Molecular Evolution. April 1989, 28 (4): 279–85. PMID 2499684. doi:10.1007/BF02103423.

[Baertsch_2008-14] [14]
Baertsch R, Diekhans M, Kent WJ, Haussler D, Brosius J. Retrocopy contributions to the evolution of the human genome. BMC Genomics. October 2008, 9: 466. PMC 2584115 . PMID 18842134. doi:10.1186/1471-2164-9-466.

[Pavlicek_2002-15] [15]
Pavlícek A, Paces J, Zíka R, Hejnar J. Length distribution of long interspersed nucleotide elements (LINEs) and processed pseudogenes of human endogenous retroviruses: implications for retrotransposition and pseudogene detection. Gene. October 2002, 300 (1–2): 189–94. PMID 12468100. doi:10.1016/S0378-1119(02)01047-8.

[16] [16]
Navarro FC, Galante PA. A Genome-Wide Landscape of Retrocopies in Primate Genomes. Genome Biology and Evolution. July 2015, 7 (8): 2265–75. PMC 4558860 . PMID 26224704. doi:10.1093/gbe/evv142.

[17] [17]
Schrider DR, Navarro FC, Galante PA, Parmigiani RB, Camargo AA, Hahn MW, de Souza SJ. Gene copy-number polymorphism caused by retrotransposition in humans. PLoS Genetics. 2013-01-24, 9 (1): e1003242. PMC 3554589 . PMID 23359205. doi:10.1371/journal.pgen.1003242.

[Max-18] [18]
Max EE. Plagiarized Errors and Molecular Genetics. TalkOrigins Archive. 2003-05-05 [2008-07-22]. （原始内容存档于2020-11-12）.

[Lynch_Conery_2000-19] [19]
Lynch M, Conery JS. The evolutionary fate and consequences of duplicate genes. Science. November 2000, 290 (5494): 1151–5. Bibcode:2000Sci...290.1151L. PMID 11073452. doi:10.1126/science.290.5494.1151.

[pmid7705642-20] [20]
Walsh JB. How often do duplicated genes evolve new functions?. Genetics. January 1995, 139 (1): 421–8. PMC 1206338 . PMID 7705642.

[pmid11779815-21] [21]
Lynch M, O'Hely M, Walsh B, Force A. The probability of preservation of a newly arisen gene duplicate. Genetics. December 2001, 159 (4): 1789–804. PMC 1461922 . PMID 11779815.

[Harrison_2002-22] [22]
Harrison PM, Hegyi H, Balasubramanian S, Luscombe NM, Bertone P, Echols N, Johnson T, Gerstein M. Molecular fossils in the human genome: identification and analysis of the pseudogenes in chromosomes 21 and 22. Genome Research. February 2002, 12 (2): 272–80. PMC 155275 . PMID 11827946. doi:10.1101/gr.207102.

[Zhang_2003-23] [23]
Zhang J. Evolution by gene duplication: an update.. Trends in Ecology and Evolution. 2003, 18 (6): 292–298. doi:10.1016/S0169-5347(03)00033-8.

[Nishikimi_1992-24] [24]
Nishikimi M, Kawai T, Yagi K. Guinea pigs possess a highly mutated gene for L-gulono-gamma-lactone oxidase, the key enzyme for L-ascorbic acid biosynthesis missing in this species. The Journal of Biological Chemistry. October 1992, 267 (30): 21967–72. PMID 1400507.

[Nishikimi_1994-25] [25]
Nishikimi M, Fukuyama R, Minoshima S, Shimizu N, Yagi K. Cloning and chromosomal mapping of the human nonfunctional gene for L-gulono-gamma-lactone oxidase, the enzyme for L-ascorbic acid biosynthesis missing in man. The Journal of Biological Chemistry. May 1994, 269 (18): 13685–8. PMID 8175804.

[Xue_2006-26] [26]
Xue Y, Daly A, Yngvadottir B, Liu M, Coop G, Kim Y, Sabeti P, Chen Y, Stalker J, Huckle E, Burton J, Leonard S, Rogers J, Tyler-Smith C. Spread of an inactive form of caspase-12 in humans is due to recent positive selection. American Journal of Human Genetics. April 2006, 78 (4): 659–70. PMC 1424700 . PMID 16532395. doi:10.1086/503116.

[Zheng-27] [27]
Zheng D, Frankish A, Baertsch R, Kapranov P, Reymond A, Choo SW, Lu Y, Denoeud F, Antonarakis SE, Snyder M, Ruan Y, Wei CL, Gingeras TR, Guigó R, Harrow J, Gerstein MB. Pseudogenes in the ENCODE regions: consensus annotation, analysis of transcription, and evolution. Genome Research. June 2007, 17 (6): 839–51. PMC 1891343 . PMID 17568002. doi:10.1101/gr.5586307.

[Prieto-Godino_2016-28] [28]
Prieto-Godino LL, Rytz R, Bargeton B, Abuin L, Arguello JR, Peraro MD, Benton R. Olfactory receptor pseudo-pseudogenes. Nature. November 2016, 539 (7627): 93–97. PMC 5164928 . PMID 27776356. doi:10.1038/nature19824.

[Pei_2012-29] [29]
Pei B, Sisu C, Frankish A, Howald C, Habegger L, Mu XJ, Harte R, Balasubramanian S, Tanzer A, Diekhans M, Reymond A, Hubbard TJ, Harrow J, Gerstein MB. The GENCODE pseudogene resource. Genome Biology. September 2012, 13 (9): R51. PMC 3491395 . PMID 22951037. doi:10.1186/gb-2012-13-9-r51.

[MacArthur_2012-30] [30]
MacArthur DG, Balasubramanian S, Frankish A, Huang N, Morris J, Walter K, et al. A systematic survey of loss-of-function variants in human protein-coding genes. Science. February 2012, 335 (6070): 823–8. PMC 3299548 . PMID 22344438. doi:10.1126/science.1215040.

[Wright_2016-31] [31]
Wright JC, Mudge J, Weisser H, Barzine MP, Gonzalez JM, Brazma A, Choudhary JS, Harrow J. Improving GENCODE reference gene annotation using a high-stringency proteogenomics workflow. Nature Communications. June 2016, 7: 11778. PMC 4895710 . PMID 27250503. doi:10.1038/ncomms11778.

[pmid7682012-32] [32]
Long M, Langley CH. Natural selection and the origin of jingwei, a chimeric processed functional gene in Drosophila. Science. April 1993, 260 (5104): 91–5. Bibcode:1993Sci...260...91L. PMID 7682012. doi:10.1126/science.7682012.

[33] [33]
Jeffs P, Ashburner M. Processed pseudogenes in Drosophila. Proceedings. Biological Sciences. May 1991, 244 (1310): 151–9. PMID 1679549. doi:10.1098/rspb.1991.0064.

[34] [34]
Wang W, Zhang J, Alvarez C, Llopart A, Long M. The origin of the Jingwei gene and the complex modular structure of its parental gene, yellow emperor, in Drosophila melanogaster. Molecular Biology and Evolution. September 2000, 17 (9): 1294–301. PMID 10958846. doi:10.1093/oxfordjournals.molbev.a026413.

[35] [35]
Dierick HA, Mercer JF, Glover TW. A phosphoglycerate mutase brain isoform (PGAM 1) pseudogene is localized within the human Menkes disease gene (ATP7 A). Gene. October 1997, 198 (1–2): 37–41. PMID 9370262. doi:10.1016/s0378-1119(97)00289-8.

[Betrán_2002-36] [36]
Betrán E, Wang W, Jin L, Long M. Evolution of the phosphoglycerate mutase processed gene in human and chimpanzee revealing the origin of a new primate gene. Molecular Biology and Evolution. May 2002, 19 (5): 654–63. PMID 11961099. doi:10.1093/oxfordjournals.molbev.a004124.

[37] [37]
Okuda H, Tsujimura A, Irie S, Yamamoto K, Fukuhara S, Matsuoka Y, Takao T, Miyagawa Y, Nonomura N, Wada M, Tanaka H. A single nucleotide polymorphism within the novel sex-linked testis-specific retrotransposed PGAM4 gene influences human male fertility. PloS One. 2012, 7 (5): e35195. PMC 3348931 . PMID 22590500. doi:10.1371/journal.pone.0035195.

[38] [38]
Chan WL, Chang JG. Pseudogene-derived endogenous siRNAs and their function. Methods in Molecular Biology. 2014, 1167: 227–39. PMID 24823781. doi:10.1007/978-1-4939-0835-6_15.

[39] [39]
Chan WL, Yuo CY, Yang WK, Hung SY, Chang YS, Chiu CC, Yeh KT, Huang HD, Chang JG. Transcribed pseudogene ψPPM1K generates endogenous siRNA to suppress oncogenic cell growth in hepatocellular carcinoma. Nucleic Acids Research. April 2013, 41 (6): 3734–47. PMC 3616710 . PMID 23376929. doi:10.1093/nar/gkt047.

[40] [40]
Roberts TC, Morris KV. Not so pseudo anymore: pseudogenes as therapeutic targets. Pharmacogenomics. December 2013, 14 (16): 2023–34. PMC 4068744 . PMID 24279857. doi:10.2217/pgs.13.172.

[41] [41]
Olovnikov I, Le Thomas A, Aravin AA. A framework for piRNA cluster manipulation. Methods in Molecular Biology. 2014, 1093: 47–58. PMID 24178556. doi:10.1007/978-1-62703-694-8_5.

[42] [42]
Siomi MC, Sato K, Pezic D, Aravin AA. PIWI-interacting small RNAs: the vanguard of genome defence. Nature Reviews Molecular Cell Biology. April 2011, 12 (4): 246–58. PMID 21427766. doi:10.1038/nrm3089.

[43] [43]
Karreth FA, Reschke M, Ruocco A, Ng C, Chapuy B, Léopold V, Sjoberg M, Keane TM, Verma A, Ala U, Tay Y, Wu D, Seitzer N, Velasco-Herrera Mdel C, Bothmer A, Fung J, Langellotto F, Rodig SJ, Elemento O, Shipp MA, Adams DJ, Chiarle R, Pandolfi PP. The BRAF pseudogene functions as a competitive endogenous RNA and induces lymphoma in vivo. Cell. April 2015, 161 (2): 319–32. PMID 25843629. doi:10.1016/j.cell.2015.02.043.

[44] [44]
Dahia PL, FitzGerald MG, Zhang X, Marsh DJ, Zheng Z, Pietsch T, von Deimling A, Haluska FG, Haber DA, Eng C. A highly conserved processed PTEN pseudogene is located on chromosome band 9p21. Oncogene. May 1998, 16 (18): 2403–6. PMID 9620558. doi:10.1038/sj.onc.1201762.

[45] [45]
Poliseno L, Salmena L, Zhang J, Carver B, Haveman WJ, Pandolfi PP. A coding-independent function of gene and pseudogene mRNAs regulates tumour biology. Nature. June 2010, 465 (7301): 1033–8. PMC 3206313 . PMID 20577206. doi:10.1038/nature09144.

[46] [46]
Balakirev ES, Ayala FJ. Pseudogenes: are they "junk" or functional DNA?. Annual Review of Genetics. 2003, 37: 123–51. PMID 14616058. doi:10.1146/annurev.genet.37.040103.103949.

[47] [47]
Goodhead I, Darby AC. Taking the pseudo out of pseudogenes. Current Opinion in Microbiology. February 2015, 23: 102–9. PMID 25461580. doi:10.1016/j.mib.2014.11.012.

[48] [48]
Williams DL, Slayden RA, Amin A, Martinez AN, Pittman TL, Mira A, Mitra A, Nagaraja V, Morrison NE, Moraes M, Gillis TP. Implications of high level pseudogene transcription in Mycobacterium leprae. BMC Genomics. August 2009, 10: 397. PMC 2753549 . PMID 19706172. doi:10.1186/1471-2164-10-397.

[3]

[1]

[2]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]

[22]

[23]

[24]

[25]

[26]

[27]

[28]

[29]

[30]

[31]

[32]

[33]

[34]

[35]

[36]

[37]

[38]

[39]

[40]

[41]

[42]

[43]

[44]

[45]

[46]

[47]

[48]