Researchers at Penn and colleagues have developed a tool to analyze single cells that assesses both the patterns of gene activation within a cell and which sibling cells shared a common progenitor.
Recent advances in analyzing data at the single-cell level have helped biologists uncover new information about cells and their behaviors. One commonly used approach, known as clustering, allows scientists to group cells based on characteristics such as the unique patterns of active or inactive genes or by the progeny of duplicating cells, known as clones, over several generations.
Although single-cell clustering has led to many significant findings, for example, new cancer cell subsets or the way immature stem cells mature into “specialized” cells, researchers have not been able to marry what they knew about gene activation patterns with what they knew about clone lineages.
Now, research published in Cell Genomics led by University of Pennsylvania professor of bioengineering Arjun Raj has resulted in the development of ClonoCluster. This open-source tool combines unique patterns of gene activation with clonal information. This produces hybrid cluster data that can quickly identify new cellular traits; that can then be used to understand resistance to some cancer therapies better.
“Before, these were independent modalities, where you would cluster the cells that express the same genes in one lot and cluster the others that share a common ancestor in another,” says Lee Richman, first paper author and a former postdoc in the Raj lab who is now at Brigham and Women’s Hospital in Boston.
“What’s exciting is that this tool allows you to draw new lines around your clusters and explore their properties, which could help us identify new cell types, functions, and molecular pathways.”
Researchers in the Raj Lab use a technique known as barcoding to assign labels to cells they are interested in studying; particularly useful for tracking cells, clustering data based on cells’ offspring, and following lineages over time.
Believing they could parse more valuable information out of this data by incorporating the cell’s unique patterns of gene activation, the researchers applied ClonoCluster to six experimental datasets that used barcoding to track dividing cells’ offspring. Specifically, they looked at the development of chemotherapy resistance and of stem cells into specialized tissue types.
Much like how stereo systems allow a user to control the balance of sound between the left speaker relative to the right, the researchers added a tunable parameter called “alpha” to ClonoCluster.
This allowed the team to fine-tune how much emphasis they placed on clustering by clonal barcode relative to patterns of gene activation—with 0 corresponding to gene activation and 1 to clonal barcodes.
On this 0 to 1 alpha scale, they found that the closer alpha was set to the clonal channel, the more their hybrid clusters expressed the unique gene levels associated with extracellular environment interactions and the translation of proteins from messenger RNA.
“Normally, when you consider how cell function is decided and regulated, you think it’s likely intrinsic mechanisms that decide which genes will be expressed, or the cell’s fate,” says Richman. “So, we believe that this hints at the fact that cells can also inherit traits through non-genetic means, possibly through environmental cues.”
The prevailing notion behind gene regulation suggests that proteins, known as transcription factors, bind to specific points on DNA sequences and activate or suppress certain genes, helping explain why most cells in an animal have the same genetic information but serve varying roles.
As these cells bind to different sites, transcription factors signal to the protein-manufacturing apparatus, which proteins the cell should produce to stay functionally relevant. Scientists can then use these biomarkers to monitor cellular properties and predict what the cell might do next.
Enhancing scientific understanding of how cells behave could improve what’s known about cellular cloning. It could also offer insight into what goes awry when duplication runs rampant, such as in the case for cancers and certain conditions associated with genetic mutations characterized by the overrepresentation of clone cells in a given cellular population.
The researchers developed a complementary tool to ClonoCluster called Warp Factor to take this work a step further. This incorporates an existing machine-learning tool that simplifies dataset visualization to explain better the relationship between clonal data and their hybrid clusters.
Using both tools, the team identified distinct expression patterns that previously seemed too dispersed to draw inferences on cell identity, information that could help researchers understand more about how cells differentiate and function.
“We think this work shows that tracking cellular identity over time can provide a new layer of information that researchers have been unaware of and hopefully can be exploited to deepen our understanding of cancer and other diseases,” Raj says.
He and colleagues will continue to update and refine ClonoCluster. They believe this is a significant advance in the field of computational cell biology with the potential to provide a more complete picture of cellular characteristics and function in the future.
Source: University of Pennsylvania