How to cite tskit-dev software
tskit
Citing tskit
If you use tskit
in your work, we recommend citing the 2024 ARG Genetics paper and the 2016 msprime PLOS Computational Biology paper:
Yan Wong, Anastasia Ignatieva, Jere Koskela, Gregor Gorjanc, Anthony W Wohns, Jerome Kelleher, A general and efficient representation of ancestral recombination graphs, Genetics, Volume 228, Issue 1, September 2024, iyae100, https://doi.org/10.1093/genetics/iyae100
Jerome Kelleher, Alison M Etheridge and Gilean McVean (2016), Efficient Coalescent Simulation and Genealogical Analysis for Large Sample Sizes, PLOS Comput Biol 12(5): e1004842. doi: 10.1371/journal.pcbi.1004842
If you use summary statistics, please cite the 2020 Genetics paper:
Peter Ralph, Kevin Thornton, Jerome Kelleher, Efficiently Summarizing Relationships in Large Samples: A General Duality Between Statistics of Genealogies and Genomes, Genetics, Volume 215, Issue 3, 1 July 2020, Pages 779–797, https://doi.org/10.1534/genetics.120.303253
Bibtex records:
@article{Wong2024ARGs,
author = {Wong, Yan and Ignatieva, Anastasia and Koskela, Jere and Gorjanc, Gregor and
Wohns, Anthony W and Kelleher, Jerome},
title = {A general and efficient representation of ancestral recombination graphs},
journal = {Genetics},
volume = {228},
number = {1},
pages = {iyae100},
year = {2024},
doi = {10.1093/genetics/iyae100}
}
@article{Kelleher2016msprime,
author = {Kelleher, Jerome and Etheridge, Alison M and McVean, Gilean},
title = {Efficient coalescent simulation and genealogical analysis for large sample sizes},
journal = {PLoS Computational Biology},
volume = {12},
number = {5},
pages = {e1004842},
year = {2016},
publisher = {Public Library of Science}
}
@article{Ralph2020Stats,
author = {Ralph, Peter and Thornton, Kevin and Kelleher, Jerome},
title = {Efficiently Summarizing Relationships in Large Samples: A General Duality Between Statistics of Genealogies and Genomes},
journal = {Genetics},
volume = {215},
number = {3},
pages = {779--797},
year = {2020},
doi = {10.1534/genetics.120.303253}
}
Citation details for tskit can be found at: https://tskit.dev/tskit/docs/stable/citation.html
msprime
Citing msprime
If you use msprime
in your work, please cite the
2022 Genetics paper
marking the 1.0 release:
Franz Baumdicker, Gertjan Bisschop, Daniel Goldstein, Graham Gower, Aaron P Ragsdale, Georgia Tsambos, Sha Zhu, Bjarki Eldon, E Castedo Ellerman, Jared G Galloway, Ariella L Gladstein, Gregor Gorjanc, Bing Guo, Ben Jeffery, Warren W Kretzschumar, Konrad Lohse, Michael Matschiner, Dominic Nelson, Nathaniel S Pope, Consuelo D Quinto-Cortés, Murillo F Rodrigues, Kumar Saunack, Thibaut Sellinger, Kevin Thornton, Hugo van Kemenade, Anthony W Wohns, Yan Wong, Simon Gravel, Andrew D Kern, Jere Koskela, Peter L Ralph and Jerome Kelleher (2022), Efficient ancestry and mutation simulation with msprime 1.0, Genetics, Volume 220, Issue 3. http://doi.org/10.1093/genetics/iyab229
You may also wish to cite the original 2016 PLOS Computational Biology paper:
Jerome Kelleher, Alison M Etheridge and Gilean McVean (2016), Efficient Coalescent Simulation and Genealogical Analysis for Large Sample Sizes, PLOS Comput Biol 12(5): e1004842. doi: 10.1371/journal.pcbi.1004842
If you use the Discrete Time Wright Fisher model, please cite the 2020 PLOS Genetics paper:
Dominic Nelson, Jerome Kelleher, Aaron P. Ragsdale, Claudia Moreau, Gil McVean and Simon Gravel (2020), Accounting for long-range correlations in genome-wide simulations of large cohorts, PLOS Genetics 16(5): e1008619. https://doi.org/10.1371/journal.pgen.1008619
Bibtex records:
@article{baumdicker2022efficient,
title={Efficient ancestry and mutation simulation with msprime 1.0},
author = {Baumdicker, Franz and Bisschop, Gertjan and Goldstein, Daniel
and Gower, Graham and Ragsdale, Aaron P and Tsambos, Georgia and Zhu, Sha
and Eldon, Bjarki and Ellerman, E Castedo and Galloway, Jared G
and Gladstein, Ariella L and Gorjanc, Gregor and Guo, Bing
and Jeffery, Ben and Kretzschumar, Warren W and Lohse, Konrad
and Matschiner, Michael and Nelson, Dominic and Pope, Nathaniel S
and Quinto-Cortés, Consuelo D and Rodrigues, Murillo F
and Saunack, Kumar and Sellinger, Thibaut and Thornton, Kevin
and van Kemenade, Hugo and Wohns, Anthony W and Wong, Yan
and Gravel, Simon and Kern, Andrew D and Koskela, Jere
and Ralph, Peter L and Kelleher, Jerome},
journal={Genetics},
volume={220},
number={3},
pages={iyab229},
year={2022},
publisher={Oxford University Press}
}
@article{kelleher2016efficient,
title={Efficient coalescent simulation and genealogical analysis for large sample sizes},
author={Kelleher, Jerome and Etheridge, Alison M and McVean, Gilean},
journal={PLoS computational biology},
volume={12},
number={5},
pages={e1004842},
year={2016},
publisher={Public Library of Science}
}
@article{nelson2020accounting,
title={Accounting for long-range correlations in genome-wide simulations of large cohorts},
author={Nelson, Dominic and Kelleher, Jerome and Ragsdale, Aaron P and
Moreau, Claudia and McVean, Gil and Gravel, Simon},
journal={PLoS genetics},
volume={16},
number={5},
pages={e1008619},
year={2020},
publisher={Public Library of Science}
}
Citation details for msprime can be found at: https://tskit.dev/msprime/docs/stable/CITATION.html
tsinfer
Citing tsinfer
If you use tsinfer
in your work, please cite the
2019 Nature Genetics paper:
Jerome Kelleher, Yan Wong, Anthony W. Wohns, Chaimaa Fadil, Patrick K. Albers & Gil McVean (2019) Inferring whole-genome histories in large population datasets, Nature Genetics, Volume 51, 1330–1338. https://doi.org/10.1038/s41588-019-0483-y
Bibtex record:
@article{Kelleher2019,
doi = {10.1038/s41588-019-0483-y},
url = {https://doi.org/10.1038/s41588-019-0483-y},
year = {2019},
month = sep,
publisher = {Springer Science and Business Media {LLC}},
volume = {51},
number = {9},
pages = {1330--1338},
author = {Jerome Kelleher and Yan Wong and Anthony W. Wohns and Chaimaa Fadil and Patrick K. Albers and Gil McVean},
title = {Inferring whole-genome histories in large population datasets},
journal = {Nature Genetics}
}
Citation details for tsinfer can be found at: https://tskit.dev/tsinfer/docs/stable/CITATION.html
tsdate
Citation
The algorithm for the inside_outside
and maximization
methods is described
in our Science paper (citation below,
preprint here).
Another repository provides
code to reproduce evaluations of the accuracy and computational requirements of these methods.
The default variational_gamma
method has not yet been described in print. For the moment,
please cite this github repository if you need a citable reference.
The original tsdate algorithm, which you should cite in published work, is published in:
Anthony Wilder Wohns, Yan Wong, Ben Jeffery, Ali Akbari, Swapan Mallick, Ron Pinhasi, Nick Patterson, David Reich, Jerome Kelleher, and Gil McVean (2022) A unified genealogy of modern and ancient genomes. Science 375: eabi8264; doi: https://doi.org/10.1126/science.abi8264
Citation details for tsdate can be found at: https://tskit.dev/tsdate/docs/stable/#citing
pyslim
Citation details for pyslim can be found at: https://zenodo.org/records/8205346
tsbrowse
Citation details for tsbrowse can be found at: https://doi.org/10.1101/2025.04.23.649987
tstrait
(citation)=
Citing tstrait
If you use tstrait
in your research project, please cite the following paper:
- Daiki Tagami, Gertjan Bisschop, and Jerome Kelleher (2024), tstrait: a quantitative trait simulator for ancestral recombination graphs,Bioinformatics, Volume 40, Issue 6. https://doi.org/10.1093/bioinformatics/btae334
Bibtex records:
@article{10.1093/bioinformatics/btae334,
author = {Tagami, Daiki and Bisschop, Gertjan and Kelleher, Jerome},
title = "{tstrait: a quantitative trait simulator for ancestral recombination graphs}",
journal = {Bioinformatics},
volume = {40},
number = {6},
pages = {btae334},
year = {2024},
month = {05},
doi = {10.1093/bioinformatics/btae334},
}
Citation details for tstrait can be found at: https://tskit.dev/tstrait/docs/stable/citation.html