In biology and medicine, not only has a lot of data already been generated, but technological advances are enabling us to gather more of it faster every year. Having a lot of data, however, does not necessarily mean we have any greater understanding of how things work.
Discovering how diseases begin and how they might be treated comes from putting the data in a context that it can be understood and then putting the data in the right hands to use it. One example of this is human genetics. Humans have about 25,000 genes, and we have data on the location of each gene, but we still don’t know what about one-third of them do.
In my lab, we focus on using computers to make sense of the “data deluge” by finding patterns within large databases and using different sources of information to identify and experimentally verify causal relationships buried within the data. Using this method, we’ve been able to predict the function of thousands of the remaining human genes that still have no known function. With collaborators, we have been able to test the predicted functions of dozens of these genes in the lab and discovered that several of these formerly unknown genes play important roles in immune cell movement, coagulation, breast cancer progression, DNA repair and cell division.
We hope to continue to push the boundaries of what we know about human genetics as far as possible with this new algorithm until, hopefully, this “Final Third” of our genome will no longer be a mystery.
My area of research is in Bioinformatics which, briefly defined, is the application of computational methods to solve biomedical problems. I focus on developing methods to enable computers to play a greater role in automatedknowledge discovery. In other words, in addition to using computers to solve specific problems, I am also interested in ways of getting computers to first establish what is known and then be able to condense large amounts of diverse data to infer what is not yet known, but statistically significant and scientifically interesting. As one might suspect, defining what is scientifically interesting turns out to be harder than defining statistical significance, but that’s what makes it fun.
In general, I am interested in both integrating and data-mining large biomedical databases for patterns that can help science accelerate its knowledge regarding the genetic causes that lead to the onset and progression of diseases. Although we’ve known for almost a decade now the physical location of the 25,000 genes we humans have, approximately one-third of them still have no known function. For genes we do know something about, the amount of information per gene is extremely skewed towards those of commercial importance and, for reasons unknown, the rate of new gene discovery has slowed noticably over the past 5 years. Emerging data indicates many, if not most, of these uncharacterized genes are just as important, biologically speaking, as the ones we do know about. These uncharacterized genes are consistently appearing in genome-wide association searches for mutations that cause human disease. Thus, there’s a growing need to accurately predict gene function.
My current research focus is on the refinement and testing of an algorithm I’ve developed to infer gene function by integrating and modeling the information contained both in the massive amount of scientific literature (over 19 million records in MEDLINE, growing at a rate of around 750,000 new scientific papers per year) and in experimental databases such as gene expression and protein-protein interaction databases. With collaborators, mostly local, we are experimentally testing the predicted gene functions and have found that it has performed very accurately so far. We have now discovered approximately 37 new genes involved in important biological processes such as coagulation, immune cell movement, cell division, brain cancer growth, endometriosis and Alzheimer’s Disease, among others. The discovery of these new genes is important because, for many of them, it opens up the possibility that we can create more accurate diagnostics for diseases, prognose disease outcome, and identify new targets for pharmaceutical intervention.
B.B.A., University of Oklahoma, 1991
B.S., University of Oklahoma, 1996
Ph.D., University of Texas Southwestern Medical Center, 2003
Honors and Awards
1989 Data Processing Management Association Scholarship
1989 and 1990 Conoco Scholarship
1999 NIH Institutional Training Grant Award in Genomic Science
2003-present Scientific Advisory Board, eTexx Biopharmaceuticals, Inc.
2003-present Board of Directors, MCBIOS
2004-2008 President, Oklahoma Bioinformatics Society (OKBIOS)
2007-2008 President, MidSouth Bioinformatics Society (MCBIOS)
2006-2007 Who’s Who in Science and Engineering
2006-2007 Who’s Who in America
2007 Who’s Who of Emerging Leaders
Ad hoc reviewer for numerous scientific journals; organizer and judge for annual OKBIOS symposia; senior editor for 2006 and 2007 MCBIOS conference proceedings; scientific review panel for Susan G. Komen Breast Cancer Foundation; selection panel for 2006 Summer Undergraduate Research Program awards (Oklahoma State Regents for Higher Education); grant review panel for Genome Canada 2005 competition III.
1998-present International Society for Computational Biology
2003-present Mid-South Computational Biology and Bioinformatics Society
2004-present Oklahoma Bioinformatics Society
Joined OMRF Scientific Staff in 2007.
Van Remmen H, Freeman WM, Miller BF, Kinter M, Wren JD, Chiao A, Towner RA, Snider TA, Sonntag WE, Richardson A. Oklahoma Nathan Shock Aging Center - assessing the basic biology of aging from genetics to protein and function. Geroscience, 2021 October, PMID: 34606039
Edwards SL, Erdenebat P, Morphis AC, Kumar L, Wang L, Chamera T, Georgescu C, Wren JD, Li J. Insulin/IGF-1 signaling and heat stress differentially regulate HSF1 activities in germline development. Cell Rep 36:109623, 2021 August, PMID: 34469721, PMCID: PMC8442575
Subramaniyan B, Larabee JL, Bodas M, Moore AR, Burgett AWG, Myers DA, Georgescu C, Wren JD, Papin JF, Walters MS. Characterization of the SARS-CoV-2 Host Response in Primary Human Airway Epithelial Cells from Aged Individuals. Viruses 13, 2021 August, PMID: 34452468, PMCID: PMC8402710
Buckley DA, Jennings EM, Burke NN, Roche M, McInerney V, Wren JD, Finn DP, McHugh PC. Erratum to: The Development of Translational Biomarkers as a Tool for Improving the Understanding, Diagnosis and Treatment of Chronic Neuropathic Pain. Mol Neurobiol. 2017 Jul 1. PMID: 28669124 PMCID: PMC4960984
Siefert JC, Georgescu C, Wren JD, Koren A, Sansam CL. DNA replication timing during development anticipates transcriptional programs and parallels enhancer activation. Genome Res. 2017 May 16. pii: gr.218602.116. PMID: 28512193 PMCID: PMC5538556
Unnikrishnan A, Jackson J, Matyi SA, Hadad N, Wronowski B, Georgescu C, Garrett KP, Wren JD, Freeman WM, Richardson A. Role of DNA methylation in the dietary restriction mediated cellular memory. Geroscience. 2017 May 5. PMID: 28477138 PMCID: PMC5505897
Ziegler J, Pody R, Coutinho de Souza P, Evans B, Saunders D, Smith N, Mallory S, Njoku C, Dong Y, Chen H, Dong J, Lerner M, Mian O, Tummala S, Battiste J, Fung KM, Wren JD, Towner RA. ELTD1, an effective anti-angiogenic target for gliomas: preclinical assessment in mouse GL261 and human G55 xenograft glioma models. Neuro Oncol. 2017 Feb 1;19(2):175-185. PMID: 27416955 PMCID: PMC5464087
Kushwaha G, Dozmorov M, Wren JD, Qiu J, Shi H, Xu D. Hypomethylation coordinates antagonistically with hypermethylation in cancer development: a case study of leukemia. Hum Genomics. 2016 Jul 25;10 Suppl 2:18. PMID: 27461342 PMCID: PMC4965721
Hadad N, Masser DR, Logan S, Wronowski B, Mangold CA, Clark N, Otalora L, Unnikrishnan A, Ford MM, Giles CB, Wren JD, Richardson A, Sonntag WE, Stanford DR, Freeman W. Absence of genomic hypomethylation or regulation of cytosine-modifying enzymes with aging in male and female mice. Epigenetics Chromatin. 2016 Jul 13;9:30. eCollection 2016. PMID: 27413395 PMCID: PMC4942942
Constantin Georgescu, Ph.D.
Associate Staff Scientist
Cory Giles, Ph.D.
Computational Postdoctoral Fellow
Senior Data Analyst
News from the Wren lab
The National Institutes of Health has awarded two grants worth a total of $26.3 million to OMRF for research into anthrax and to help train new scientists. Each grant will allow scientists to continue research started in 2004 and 2005 and keep them working through 2014 on several interconnected projects. In the first project, a $14.5 […]
Much like the world of college essays, scientific journals are often plagued with authors trying to publish someone else’s work as their own. For Oklahoma Medical Research Foundation scientist Jonathan Wren, Ph.D., the issue hit home as part of his duties as an editor for the journal Bioinformatics, when a reviewer recognized a paper as […]
Rare is the scientific paper today written by a single author. With research being conducted by teams of scientists, most studies now boast a half-dozen or so authors. According to a new study led by a scientist at the Oklahoma Medical Research Foundation, credit for those papers is far from evenly distributed, and the order […]
The Oklahoma Medical Research Foundation today announced the addition of four scientists to the faculty of its Arthritis & Immunology Research Program. The four new faculty members are Patrick Gaffney, M.D., Kathy Moser, Ph.D., Jonathan Wren, Ph.D., and Igor Dozmorov, Ph.D. “Their recruitment gives us a depth of scientific commitment and expertise that is unparalleled […]