Training deep learning models on personalized genomic sequences improves variant effect prediction
Published in BioRxiv, 2024
Sequence-to-function models have broad applications in interpreting the molecular impact of genetic variation, yet have been criticized for poor performance in this task. Here we show that training models on functional genomic data with matched personal genomes improves their performance at variant effect prediction. Variant effect representations are retained even when fine tuning models to unseen cellular contexts and experimental readouts. Our results have implications for interpreting trait-associated genetic variation.
Recommended citation: A. Y. He, N. P. Palamuttam, and C. G. Danko. Training deep learning models on personalized genomic sequences improves variant effect prediction. BioRxiv. doi.org/10.1101/2024.10.15.618510
Download Paper