The evolution of human evolution

Last Updated
Project Description

In principle, Darwinian evolution requires at least two essential ingredients: (i) processes that change the inherited genetic material (i.e., mutation of the germline DNA); and (ii) processes that cause natural section based on the functional/phenotypic results of these genetic changes. Germline mutations are believed to predominately originate from endogenous cellular processes with minor contributions from exogenous processes. Each mutational process imprints a characteristic mutational pattern on the genome, termed, mutational signature. For example, the deamination of 5-methylcytosine to thymine is an endogenous process generating C:G>T:A mutations at CpG dinucleotides, while CC:GG>TT:AA doublet substitutions occurring at dypyrimidines are associated with exogenous exposure to ultraviolet light. Analyses of mutational signatures in thousands of cancer genomes has revealed the signatures of more that 100 mutational processes. Some of these mutational processes are operative throughout the entire lifetime of an individual whereas others are present only at certain stages of life. The signatures of processes gradually accumulating throughout the entire lifetime of an individual are referred to as clock-like mutational signatures, and these include signature 1 (etiology: deamination of 5-methylcytosine) and signature 5 (etiology: unknown). Previous work on de novo germline mutations derived from family trios demonstrated that signatures 1 and 5 can explain the majority of these germline variants, indicating that the clock-like signatures are the main contributors to human evolution. However, the activity of different mutational signatures has never been evaluated in regard to the phylogenetic timeline of human evolution. In this rotation project, we will analyze data from the 1000 genomes project, a database containing the germline genomes of 2,504 individuals from 26 populations. This database includes 84.7 million single nucleotide polymorphisms (SNPs) and 3.6 million short insertions/deletions (indels) phased onto high-quality haplotypes. Using these data, we will build a phylogenetic tree (i.e., a tree showing the evolutionary relationship between individuals) where each leaf of the tree will contain the private germline mutations derived from a single individual. The activity of mutational signatures will be evaluated in each leaf as well as in each node of the phylogenetic three. The analysis will reveal the activity of mutational signatures throughout human evolution.

Understanding the molecular landscape of precancers for preventing cancer

Last Updated
Project Description

All cancers originate from a single cell that undergoes a transformation from a normally functioning somatic cell into a malignant neoplasm. In most cases, this transformation follows a stepwise process with the somatic cell first expanding into a precancer and, subsequently, becoming an advanced invasive cancer. The progression from a pre-malignant tumor to a malignant neoplasm is due to somatic mutations that can be traced, characterized, and genomically studied. In this rotation, the student will evaluate the mutational burden, driver mutations, copy number changes, mutational signatures, and subclonal architecture of pre-malignant lesions and compare them to molecular events previously identified in advanced invasive cancers. The goal is to reveal the molecular events that are necessary for a precancer to convert into cancer. Independent previously generated drug-screen datasets (e.g., Cancer Cell Line Encyclopedia) will be used to propose potential intervention strategies that can used to target these molecular events in order to halt this conversion and lead to cancer prevention.

Michael Cuoco

First Name
Michael
Last Name
Cuoco
Student Status
Graduate Student
Email
mcuoco@ucsd.edu
Major
Bioinformatics and Systems Biology
Co-Advisor
Completed Degrees

B.S., Molecular and Cellular Biology, Trinity College-Connecticut, 2017

BISB Training Grant
No
Special Funding or Awards

2020-21 Trainee on NIH Training Grant in Bioinformatics

NSF Graduate Research Fellowship

Research Focus
Genomics approaches to understanding retrotransposon activity in the human brain

Severine Soltani

First Name
Severine
Last Name
Soltani
Student Status
Graduate Student
Email
ssoltani@ucsd.edu
Major
Bioinformatics and Systems Biology
Advisor
Completed Degrees

B.S., Cognitive Science w/ Specialization in Machine Learning and Neural Computation, University of California, San Diego, 2020
Minor, Mathematics, University of California, San Diego, 2020

BISB Training Grant
No
Special Funding or Awards

2021-23 Trainee on NIH Training Grant in Bioinformatics

Karla P. Godinez-Macias

First Name
Karla P.
Last Name
Godinez-Macias
Student Status
Alumni
Email
kpgodine@ucsd.edu
Major
Bioinformatics and Systems Biology
Co-Advisor
Completed Degrees

B.S., Computer Science, Universidad Autónoma de Baja California, 2016
M.S., Bioinformatics, The University of Texas at El Paso, 2019

BISB Training Grant
No
Degree Conferred
Ph.D.
Year Graduated
Quarter Graduated
Spring
Thesis Title
Leveraging computational approaches for the identification of therapeutic drug target candidates in Plasmodium parasites

Arya Massarat

First Name
Arya
Last Name
Massarat
Student Status
Graduate Student
Email
amassara@ucsd.edu
Major
Bioinformatics and Systems Biology
Completed Degrees

B.S., Mathematical and Computational Biology, Harvey Mudd College, 2020

BISB Training Grant
No
Special Funding or Awards

2020-21 Trainee on NIH Training Grant in Bioinformatics

NSF Graduate Research Fellowship

Research Focus
Fine-mapping, Reproducibility

Lauryn Bruce

First Name
Lauryn
Last Name
Bruce
Student Status
Alumni
Email
lbruce@ucsd.edu
Major
Bioinformatics and Systems Biology with a Specialization in Biomedical Informatics
Advisor
Co-Advisor
BISB Training Grant
No
Special Funding or Awards

2020-23 Trainee on National Library of Medicine (NLM) Training Grant Fellowship Program

Degree Conferred
Ph.D.
Year Graduated
Quarter Graduated
Fall
Thesis Title
Promoting Equitable Health Using Time Series Wearable Data to Explore Sex Differences and Characterize Pregnancy