New statistical tool will improve understanding of cancer genetics

July 12, 2013

An article published in the July 8 Proceedings of the National Academy of Sciences (PNAS) describes the development of a new data-mining tool that will improve researchers’ understanding of cancer genetics.

The paper, “Biclustering with heterogeneous variance,” is co-authored by Guanhua Chen, biostatistics doctoral student in the Gillings School of Global Public Health; Patrick Sullivan, MD, Ray M. Hayworth and Family Distinguished Professor of psychiatry, professor of genetics, and adjunct professor of epidemiology in The University of North Carolina at Chapel Hill School of Medicine and Gillings School of Global Public Health; and Michael Kosorok, PhD, W.R. Kenan Jr. Distinguished Professor and chair of biostatistics and professor of statistics and operations research.

The diagnosis and treatment of disease is improved by categorizing patients into subtypes based on a disease’s etiology and types of therapy to which it responds. This is particularly true of cancer, which in reality is composed of several diseases.

One way to group patients is by clustering, which categorizes subgroups of individuals with similar genetic profiles. Clustering is a means of grouping such that the objects in one cluster have more similarity to each other than to objects in other groups.

Current clustering methods have limitations, however, as the process does not account for the fact that people or disease characteristics may display differing magnitudes of volatility in the way genes decode, or express, genetic information. This “heterogeneity of variance,” if not accounted for, can lead to inaccurate cluster sets and result in incorrect research results.

Chen and colleagues developed and implemented a statistical framework that captures both mean and variance structures in genetic data. The resulting data-mining tool, which the researchers applied both to synthetic (simulated) data and to two cancer data sets, identifies for the first time certain genes and cancer types that express hypervariability of DNA methylation levels and detects clearer subgroup patterns in lung cancer. DNA methylation is a process in which methyl groups are added to certain DNA nucleotides in order to maintain healthy cell life.

“Not only is this work important scientifically,” Kosorok said, “but it is also significant that a paper appearing in a top-tier general science journal such as PNAS has a student as first author.”

The software used in the study is available online.

Gillings School of Global Public Health contact: David Pesci, director of communications, (919) 962-2600 or dpesci@unc.edu.

CONTACT INFORMATION

Visit our communications and marketing team page.
Contact sphcomm@unc.edu with any media inquiries or general questions.

Communications and Marketing Office
125 Rosenau Hall
CB #7400
135 Dauer Drive
Chapel Hill, NC 27599-7400

New Bachelor of Science in Public Health program in Community and Global Public Health

READ

April 25, 2024
Beginning this fall, students can begin applying for the new degree program from the Department of Health Behavior.

Gillings school student among 16 to receive prestigious funding from National Science Foundation Graduate Research Fellowship Program

Impact Awards: 7 Gillings School students honored for transformative research contributions

Honoring the memory of Professor Emeritus Donald Willhoit

Combining food taxes and subsidies can lead to healthier grocery purchases for low-income households

See All Latest News

New statistical tool will improve understanding of cancer genetics

New Bachelor of Science in Public Health program in Community and Global Public Health

Information for: