Professor of Bioinformatics
Fellow at University College
I lived in Denmark until 1985, except for short periods in England, Austria and Italy. Then I had a very long series of Postdocs in North Carolina, California, Montreal, Japan and a few other places. From 1991 to 2001 I was a lecturer at Aarhus University and in 2001 I moved to Oxford, where I have been ever since, except for sabbatical in New Zealand, Berkeley and Chapel Hill.
Stochastic and Algorithmic Aspects of Molecular Evolution, Origins of Life and Population Genetics.
Most of my interests involve molecular evolution and molecular population genetics.
Statistical Alignment. A major project in the last 5-7 years has been the development of algorithms calculating the likelihood of a set of homologous sequences that has evolved by both insertions, deletions and substitutions. This field is also called Statistical Alignment since it can produce alignments and probability statements about the alignment. The motivation for studying, modelling and making algorithms for statistical alignment is that most sequence analysis has, in the last decade or more, benefited tremendously from the use of stochastic models of sequence evolution. This also allows parameter estimation, hypothesis testing and more.
Ancestral Recombination Graph. I and colleagues have developed a method that will find a history of a set of sequences that minimizes the number of recombinations plus substitutions and displays a history of the sequences that use this minimal number of events. This history will also find a set of intervals within which there hasn’t been any recombinations. This is a rational definition of a haplotype block. It is highly computationally demanding.
Evolutionary Models. This is a large topic and anything where you can have two examples and call them homologous in principle needs an evolutionary model. I have been interested in networks, but presently multigene families and protein structures are the focus. Multigene families have been studied for decades but only recently has it been viewed as a statistical problem.
Computational Models of Origins of Life. This is one of science’s great unsolved problems that is bound to get much more attention in the near future and turn much more computational as everything else in the biosciences.
Comments from study groups are often placed on the these facebook pages
- Science Book Discussion Club
- Classical Papers Discussion Group
- Humanities Book Club
- Jotun Hein Facebook
- Gemmell, Hein, and Katzourakis. Phylogenetic Analysis Reveals That ERVs” Die Young” but HERV-H Is Unusually Conserved. PLoS Comput Biol 12.6 (2016): e1004964.
Dialdestoro, K., Sibbesen, J.A., Maretty, L., Raghwani, J., Gall, A., Kellam, P., Pybus, O.G., Hein, J. and Jenkins, P.A., 2016. Coalescent Inference Using Serially Sampled, High-Throughput Sequencing Data from Intrahost HIV Infection. Genetics, 202(4), pp.1449-1472.
- Herman, Novák, Lyngsø, Szabó, Miklós and Hein., 2015. Efficient representation of uncertainty in multiple sequence alignments using directed acyclic graphs. BMC bioinformatics, 16(1), p.1.
- Satija, R., Novák, Á., Miklós, I., Lyngsoe, R. & Hein, J. (2009) BigFoot: Bayesian Alignment and Phylogenetic Footprinting with MCMC. BMC Evolutionary Biology 9, 217
- Lyngsø, R.B., Song, Y.S. & Hein, J. (2008) Accurate Computation of Likelihoods in the Coalescent with Recombination Via Parsimony. Lecture Notes Comput Sci 4955 (LNBI), 463–477