Human capacitance to dosage imbalance: coping with inefficient selection.

Proteins rely on associations to improve packing quality and thus maintain structural integrity. This makes packing deficiency a likely determinant of dosage sensitivity, that is, of the fitness impact of concentration imbalances relative to the stoichiometry of the protein complexes. This hypothesis was validated by examining evolution-related dosage imbalances: Duplicates of genes encoding for deficiently packed proteins are less likely to be retained than genes coding for well-packed proteins. This selection pressure is apparent in unicellular organisms, but is mitigated in higher eukaryotes. In human, this effect reveals a capacitance toward dosage imbalance. This capacitance is not expected in organisms with larger population size, where evolutionary forces are more efficient at promoting adaptive functional innovation and purifying selection, thus curbing the concentration imbalance arising from gene duplication. By examining miRNA target dissimilarities within human gene families, we show that the capacitance is operative at a post-transcriptional regulatory level: The higher the packing deficiency of a protein, the more likely that its paralogs will be dissimilarly targeted by miRNA to mitigate dosage imbalance. For families with low capacitance, paralog sequence divergence and family size correlate tightly with packing deficiency, just like in unicellular eukaryotes. Thus, a major component of human tolerance toward dosage imbalances is rooted in the paralog-discriminating capacity of miRNA regulation. The results may clarify the evolutionary etiology of aggregation-related diseases, since aggregation is often promoted by overexpression (a dosage imbalance) and aggregation propensity is associated with extreme packing deficiency.

[1]  Tanya Z. Berardini,et al.  The Arabidopsis Information Resource (TAIR): gene structure and function annotation , 2007, Nucleic Acids Res..

[2]  C. Burge,et al.  Conserved Seed Pairing, Often Flanked by Adenosines, Indicates that Thousands of Human Genes are MicroRNA Targets , 2005, Cell.

[3]  M. Lynch,et al.  The Origins of Genome Complexity , 2003, Science.

[4]  R. Hilgenfeld,et al.  Utility of homology models in the drug discovery process , 2004, Drug Discovery Today.

[5]  Ariel Fernández,et al.  Dehydration propensity of order-disorder intermediate regions in soluble proteins. , 2007, Journal of proteome research.

[6]  Z. Yang,et al.  Estimating synonymous and nonsynonymous substitution rates under realistic evolutionary models. , 2000, Molecular biology and evolution.

[7]  Ariel Fernández,et al.  Keeping dry and crossing membranes , 2004, Nature Biotechnology.

[8]  Jing Wang,et al.  Using FlyAtlas to identify better Drosophila models of human disease , 2008 .

[9]  R. Veitia Gene dosage balance: deletions, duplications and dominance. , 2005, Trends in genetics : TIG.

[10]  Ariel Fernández,et al.  Adherence of packing defects in soluble proteins. , 2003, Physical review letters.

[11]  Eugene V Koonin,et al.  A common framework for understanding the origin of genetic dominance and evolutionary fates of gene duplications. , 2004, Trends in genetics : TIG.

[12]  C. Pál,et al.  Dosage sensitivity and the evolution of gene families in yeast , 2003, Nature.

[13]  T. Blundell,et al.  Comparative protein modelling by satisfaction of spatial restraints. , 1993, Journal of molecular biology.

[14]  Ariel Fernández,et al.  Structural defects and the diagnosis of amyloidogenic propensity , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[15]  Ariel Fernández,et al.  Protein Under-Wrapping Causes Dosage Sensitivity and Decreases Gene Duplicability , 2007, PLoS genetics.

[16]  P. Radivojac,et al.  PROTEINS: Structure, Function, and Bioinformatics Suppl 7:176–182 (2005) Exploiting Heterogeneous Sequence Properties Improves Prediction of Protein Disorder , 2022 .

[17]  D. Bartel MicroRNAs: Target Recognition and Regulatory Functions , 2009, Cell.

[18]  Ariel Fernández,et al.  Insufficiently dehydrated hydrogen bonds as determinants of protein interactions , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[19]  L. Kruglyak,et al.  Comparative Developmental Expression Profiling of Two C. elegans Isolates , 2008, PloS one.

[20]  Stefan R. Henz,et al.  A gene expression map of Arabidopsis thaliana development , 2005, Nature Genetics.

[21]  C. Burge,et al.  Most mammalian mRNAs are conserved targets of microRNAs. , 2008, Genome research.

[22]  Wen-Hsiung Li,et al.  MicroRNA regulation of human protein protein interaction network. , 2007, RNA.

[23]  R. Veitia,et al.  Exploring the etiology of haploinsufficiency. , 2002, BioEssays : news and reviews in molecular, cellular and developmental biology.

[24]  S. Batalov,et al.  A gene atlas of the mouse and human protein-encoding transcriptomes. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[25]  John Kuriyan,et al.  The origin of protein interactions and allostery in colocalization , 2007, Nature.

[26]  Andrej ⩽ali,et al.  Comparative protein modeling by satisfaction of spatial restraints , 1995 .

[27]  G. Wray,et al.  Abundant raw material for cis-regulatory evolution in humans. , 2002, Molecular biology and evolution.

[28]  Hideki Innan,et al.  Very Low Gene Duplication Rate in the Yeast Genome , 2004, Science.

[29]  L. Lim,et al.  MicroRNA targeting specificity in mammals: determinants beyond seed pairing. , 2007, Molecular cell.

[30]  Andreas Prlic,et al.  Ensembl 2006 , 2005, Nucleic Acids Res..

[31]  Scott A. Givan,et al.  ASRP: the Arabidopsis Small RNA Project Database , 2004, Nucleic Acids Res..

[32]  Marc A. Martí-Renom,et al.  Tools for comparative protein structure modeling and analysis , 2003, Nucleic Acids Res..

[33]  R. Russell,et al.  The relationship between sequence and interaction divergence in proteins. , 2003, Journal of molecular biology.

[34]  Feng-Chi Chen,et al.  Genomic divergences between humans and other hominoids and the effective population size of the common ancestor of humans and chimpanzees. , 2001, American journal of human genetics.

[35]  A Keith Dunker,et al.  Alternative splicing in concert with protein intrinsic disorder enables increased functional diversity in multicellular organisms. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[36]  A. Sali,et al.  Comparative protein structure modeling of genes and genomes. , 2000, Annual review of biophysics and biomolecular structure.

[37]  D. Nicolae,et al.  Rapid divergence in expression between duplicate genes inferred from microarray data. , 2002, Trends in genetics : TIG.

[38]  Max F. Perutz,et al.  Mechanisms of Cooperativity and Allosteric Regulation in Proteins , 1990 .