In biology and medicine, not only has a lot of data already been generated, but technological advances are enabling us to gather more of it faster every year. Having a lot of data, however, does not necessarily mean we have any greater understanding of how things work.
Discovering how diseases begin and how they might be treated comes from putting the data in a context that it can be understood and then putting the data in the right hands to use it. One example of this is human genetics. Humans have about 25,000 genes, and we have data on the location of each gene, but we still don’t know what about one-third of them do.
In my lab, we focus on using computers to make sense of the “data deluge” by finding patterns within large databases and using different sources of information to identify and experimentally verify causal relationships buried within the data. Using this method, we’ve been able to predict the function of thousands of the remaining human genes that still have no known function. With collaborators, we have been able to test the predicted functions of dozens of these genes in the lab and discovered that several of these formerly unknown genes play important roles in immune cell movement, coagulation, breast cancer progression, DNA repair and cell division.
We hope to continue to push the boundaries of what we know about human genetics as far as possible with this new algorithm until, hopefully, this “Final Third” of our genome will no longer be a mystery.
B.B.A., University of Oklahoma, 1991
B.S., University of Oklahoma, 1996
Ph.D., University of Texas Southwestern Medical Center, 2003
Honors and Awards
1989 Data Processing Management Association Scholarship
1989 and 1990 Conoco Scholarship
1999 NIH Institutional Training Grant Award in Genomic Science
2003-present Scientific Advisory Board, eTexx Biopharmaceuticals, Inc.
2003-present Board of Directors, MCBIOS
2004-2008 President, Oklahoma Bioinformatics Society (OKBIOS)
2007-2008 President, MidSouth Bioinformatics Society (MCBIOS)
2006-2007 Who’s Who in Science and Engineering
2006-2007 Who’s Who in America
2007 Who’s Who of Emerging Leaders
Ad hoc reviewer for numerous scientific journals; organizer and judge for annual OKBIOS symposia; senior editor for 2006 and 2007 MCBIOS conference proceedings; scientific review panel for Susan G. Komen Breast Cancer Foundation; selection panel for 2006 Summer Undergraduate Research Program awards (Oklahoma State Regents for Higher Education); grant review panel for Genome Canada 2005 competition III.
1998-present International Society for Computational Biology
2003-present Mid-South Computational Biology and Bioinformatics Society
2004-present Oklahoma Bioinformatics Society
Joined OMRF Scientific Staff in 2007.
My area of research is in Bioinformatics which, briefly defined, is the application of computational methods to solve biomedical problems. I focus on developing methods to enable computers to play a greater role in automatedknowledge discovery. In other words, in addition to using computers to solve specific problems, I am also interested in ways of getting computers to first establish what is known and then be able to condense large amounts of diverse data to infer what is not yet known, but statistically significant and scientifically interesting. As one might suspect, defining what is scientifically interesting turns out to be harder than defining statistical significance, but that’s what makes it fun.
In general, I am interested in both integrating and data-mining large biomedical databases for patterns that can help science accelerate its knowledge regarding the genetic causes that lead to the onset and progression of diseases. Although we’ve known for almost a decade now the physical location of the 25,000 genes we humans have, approximately one-third of them still have no known function. For genes we do know something about, the amount of information per gene is extremely skewed towards those of commercial importance and, for reasons unknown, the rate of new gene discovery has slowed noticably over the past 5 years. Emerging data indicates many, if not most, of these uncharacterized genes are just as important, biologically speaking, as the ones we do know about. These uncharacterized genes are consistently appearing in genome-wide association searches for mutations that cause human disease. Thus, there’s a growing need to accurately predict gene function.
My current research focus is on the refinement and testing of an algorithm I’ve developed to infer gene function by integrating and modeling the information contained both in the massive amount of scientific literature (over 19 million records in MEDLINE, growing at a rate of around 750,000 new scientific papers per year) and in experimental databases such as gene expression and protein-protein interaction databases. With collaborators, mostly local, we are experimentally testing the predicted gene functions and have found that it has performed very accurately so far. We have now discovered approximately 37 new genes involved in important biological processes such as coagulation, immune cell movement, cell division, brain cancer growth, endometriosis and Alzheimer’s Disease, among others. The discovery of these new genes is important because, for many of them, it opens up the possibility that we can create more accurate diagnostics for diseases, prognose disease outcome, and identify new targets for pharmaceutical intervention.
Dozmorov MG, Giles CB, Koelsch KA, Wren JD. Systematic classification of non-coding RNAs by epigenomic similarity. BMC Bioinformatics 14 Suppl 14:S2, 2013. [Abstract]
Giles CB, Girija-Devi R, Dozmorov MG, Wren JD. mirCoX: a database of miRNA-mRNA expression correlations derived from RNA-seq meta-analysis. BMC Bioinformatics 14 Suppl 14:S17, 2013. [Abstract]
Dozmorov MG, Wren JD, Alarcon-Riquelme ME. Epigenomic elements enriched in the promoters of autoimmunity susceptibility genes. Epigenetics 9: 276-285, 2013. [Abstract]
Dozmorov MG, Cara LR, Giles CB, Wren JD. GenomeRunner: Automating genome exploration. Bioinformatics 28:419-420, 2012. [Abstract]
Daum JR, Wren JD, Daniel JJ, Sivakumar S, McAvoy JN, Potapova TA, Gorbsky GJ. Ska3 is required for spindle checkpoint silencing and the maintenance of chromosome cohesion in mitosis. Curr Biol 19:1467-1472, 2009. [Abstract]
Wren JD. A global meta-analysis of microarray expression data to predict unknown gene functions and estimate the literature-data divide. Bioinformatics 25:1694-1701, 2009. [Abstract]
Lupu C, Zhu H, Popescu NI, Wren JD, Lupu F. Novel protein ADTRP regulates TFPI expression and function in human endothelial cells in normal conditions and in response to androgen. Blood 118:4463-4471, 2011. [Abstract]
Giles CB, Wren JD. Large-scale directional relationship extraction and resolution. BMC Bioinformatics 9 Suppl 9:S11, 2008. [Abstract]
Wren JD, Bekeredjian R, Stewart JA, Shohet RV, Garner HR. Knowledge discovery by automated identification and ranking of implicit relationships. Bioinformatics 20:389-398, 2004. [Abstract]
Arthritis & Clinical Immunology Research Program, MS 58
Oklahoma Medical Research Foundation
825 N.E. 13th Street
Oklahoma City, OK 73104
Phone: (405) 271-6989
Fax: (405) 271-4110