Establishing pangenome graph as a framework for the analysis of the impact of structural variation on gene expression

dc.contributor.advisorGolicz, Agnieszka
dc.contributor.advisorSnowdon, Rod
dc.contributor.advisorFrei, Michael
dc.contributor.advisorClifton-Brown, John
dc.contributor.authorYildiz, Gözde
dc.date.accessioned2026-03-20T12:42:53Z
dc.date.issued2025
dc.description.abstractThis thesis addresses key challenges and opportunities in structural variation (SV) detection, genotyping, and downstream analyses using pangenome variation graphs for Brassica napus. By integrating different sequencing technologies and available bioinformatics tools, it demonstrates strategies to optimize the analysis of complex plant genomes, which are typically large, repetitive, and polyploid. Popular mapping and SV calling pipelines were evaluated using both simulated and real datasets from major crops, including rapeseed, tomato, maize, and soybean, across low to medium sequencing depths. The results demonstrate the feasibility of cost-effective SV detection, identifying the most efficient aligners and callers that achieve robust performance even at low coverage (≥5×). These findings provide a practical framework for population-scale crop studies, where sequencing costs and coverages are often a limiting factor. A graph-based pangenome approach was developed by combining long-read SV discovery with pangenome reference, allowing comparison with existing references to assess and reduce reference bias. This strategy enabled the identification of SVs, the construction of graph-based pangenomes, and subsequent SV genotyping in larger populations using short-read data, eliminating the need for costly de novo assemblies. The approach is therefore scalable, accessible, and suitable for high-throughput crop genomics. Importantly, integration with gene expression data revealed that many SVs, particularly those linked to transposable elements, significantly affect gene regulation and may underlie key agronomic traits. Overall, this thesis demonstrates that SVs are not only a major source of genetic diversity but also critical drivers of gene regulatory variation in crops. By providing benchmarking guidelines, novel graph-based pipelines, and functional insights into SVs, it lays the foundation for incorporating structural variation into future genome-informed breeding and trait discovery, ultimately supporting the development of more resilient and productive crop varieties.
dc.identifier.urihttps://jlupub.ub.uni-giessen.de/handle/jlupub/21418
dc.identifier.urihttps://doi.org/10.22029/jlupub-20765
dc.language.isoen
dc.relation.hasparthttps://doi.org/10.1079/9781789247848.0008
dc.relation.hasparthttps://doi.org/10.1002/tpg2.20314
dc.relation.hasparthttps://doi.org/10.1007/s00122-025-04867-2
dc.rightsIn Copyright
dc.rights.urihttp://rightsstatements.org/page/InC/1.0/
dc.subject.ddcddc:580
dc.subject.ddcddc:630
dc.titleEstablishing pangenome graph as a framework for the analysis of the impact of structural variation on gene expression
dc.typedoctoralThesis
dcterms.dateAccepted2025-12-08
local.affiliationFB 09 - Agrarwissenschaften, Ökotrophologie und Umweltmanagement
thesis.levelthesis.doctoral

Dateien

Originalbündel

Gerade angezeigt 1 - 1 von 1
Lade...
Vorschaubild
Name:
YildizGözde-2026-03-16.pdf
Größe:
9.93 MB
Format:
Adobe Portable Document Format

Lizenzbündel

Gerade angezeigt 1 - 1 von 1
Lade...
Vorschaubild
Name:
license.txt
Größe:
7.58 KB
Format:
Item-specific license agreed upon to submission
Beschreibung: