Computational and RNA Biology

Research

Our eleven strong research groups work in the fields of RNA biology, bioinformatics, computational biology, machine learning and population and statistical genetics, and our researchers consistently publish their studies in highly recognized international journals. The Section houses the Bioinformatics Centre and has state-of-the-art computational infrastructure and laboratory facilities.

Education

We offer a 3-year Bioinformatics Bachelor program. We also offer a 2-year Bioinformatics Master program to students, who have earned the BSc in Bioinformatics or have a background either in computer science or in molecular biology, biochemistry or biomedicine. Our students acquire thorough theoretical knowledge and hands-on experience in bioinformatics, including sequence analysis, protein and RNA structural analysis, genomics, phylogenetics, analysis of high-throughput big data and machine learning methods.

Values

We strive towards a friendly, collaborative and professional work environment that promotes excellent research and personal development.

 

BIOINFORMATICS CENTRE

Computational Biology

We are interested in the mechanisms of gene regulation and the prediction of RNA and protein structure. Our research encompasses a combination of experimental genomics methods and computational biology including promoter and enhancer analysis, and development of complex probabilistic models to predict, design and validate structure based on machine learning methods.


Thomas Hamelryck Thomas Hamelryck, Professor
KU profile page | Personal research page
We are engaged with predicting, designing and determining the 3D structure of RNA and proteins, by developing sophisticated probabilistic models that describe aspects of protein structure. These models are mainly based on machine learning methods (including dynamic Bayesian networks), and directional statistics, the statistics of angles, directions and orientations.

Albin Sandelin Albin Sandelin, Professor
KU profile page | Personal research page
The Sandelin lab is a computation/experimental group with scientists from many fields. We focus on gene regulation, transcriptomics, epigenetics and technological and informatics aspects. With the help of computers, we probe large biological datasets that are generated using novel genomics techniques. One of our strengths is our many collaborations with high-profile experimental laboratories, which supply data to be analyzed.

Robin Andersson Robin Andersson, Associate Professor
KU profile page | Personal research page
The Andersson lab aims to characterize and better understand the architectures of transcriptional regulation and the fundamental properties of enhancers and promoters. In particular, we focus on enhancer transcription and its association with regulatory activity. We take a genomics approach and use computational and statistical learning techniques to model transcriptional regulation based on large-scale sequencing data.

Ole Winther Ole Winther, Professor 
KU profile page
We have two focuses: developing machine learning and AI methods and applying them to genomic data in a clinical setting, and biological sequence analysis and medical informatics. The machine learning research is done with the jointly affiliated group at DTU Compute. Clinical genomic research is carried out in collaboration with Genomic Medicine, Rigshospitalet. An example of a current project is deep generative models for analysis of single cell RNAseq data.

Amelie Stein Amelie Stein, Associate Professor
KU profile page
Our lab studies the consequences of sequence variants on proteins, focusing on their cellular stability and function. We perform high-throughput assays on protein variants and build on this data to develop and improve methods for prediction of variant consequences. We then apply these methods to determine whether genomic variants are likely to be pathogenic. Further we aim to integrate effects of multiple mutations for applications in protein engineering.


BIOINFORMATICS CENTRE

Population and Statistical Genetics

Our group develops and applies statistical and computational methods for analysis of genomic data in diverse organisms ranging from Greenlandic populations to ruminants and African mammals. We contribute to multiple fields including human and animal disease and treatment, livestock production, migration and speciation processes and complex population analyses.


Hans Siegismund Hans Siegismund, Associate Professor
KU profile page
We work on population genetics, phylogeography and speciation processes of large African mammals, mainly bovids and great apes.  Another research area includes the study of evolutionary genetics of Foot-and-mouth disease (FMD) virus in East Africa.

Anders Albrechtsen Anders Albrechtsen, Professor
KU profile page | Personal research page
Our group develops statistical and computational methods for analysis of genomic data including methods for performing multi-loci association studies, methods for detecting and correcting for population stratification, methods for detecting natural selection, loci dependent methods for modeling identity-by-descent and various methods for analysis of second generation sequencing data.

Ida Moltke Ida Moltke, Associate Professor
KU profile page | Personal research page
We develop and apply statistical methods to genomic data with the purpose of gaining insights into human disease, history and evolution. For instance, by studying DNA from the Greenlandic population we recently identified a genetic variant that explains 10-15% of all cases of type 2 diabetes in Greenland. We have also looked into the migration history of the Artic and are currently investigating how the Greenlanders have genetically adapted the Arctic cold and their very fat-rich diet consisting mainly of seal and fish.

Rasmus Heller Rasmus Heller, Associate Professor
KU profile page | Personal research page
We study evolutionary and population genetics in wild mammals, focusing on ruminants. Our research interests include elucidating how climate, ecosystems and humans have influenced wild mammal populations in Africa, studying adaptive introgression in bovids and exploring how genomic data can be used to aid conservation. We are also involved in a ruminant genome project which aims to understand how ruminants evolved new anatomical structures, and how it has helped them become one of the most successful mammal lineages.

RNA BIOLOGY

We employ and develop molecular, genetic and biochemical approaches to study how RNA-based mechanisms regulate gene expression and cellular development. We focus on RNA structure, RNA modifications and RNA-protein interactions as well as miRNA regulation in plants.


Peter Brodersen Peter Brodersen, Professor
KU profile page | Personal research page
Our group studies the mechanisms by which small RNAs regulate gene expression. We use the flowering plant Arabidopsis thaliana as a model system, and make particular use of molecular genetic and biochemical approaches in our work.

Jan Christiansen Jan Christiansen, Associate Professor (emeritus)
KU profile page | Personal research page
We are mainly focusing on post-transcriptional events such as RNA localization, RNA stability, and translational control with an emphasis on the role of cytoplasmic RNA-binding proteins expressed during fetal life and oncogenesis.

Jeppe Vinther Jeppe Vinther, Associate Professor
KU profile page | Personal research page
In our group we aim to determine how RNA structure and RNA-protein interactions affect basal cellular processes. This knowledge is important to understand ways of improving the efficiency and specificity of RNA based drugs.

In 2002, the Faculty of Science at University of Copenhagen recognized the importance of bioinformatics research and education and Professor Anders Krogh was recruited to spearhead the efforts. Under his leadership, and with the generous support from the Novo Nordisk Foundation, the Bioinformatics Centre was founded in 2005 as an independent Centre.

The Centre, which is housed in Section for Computational and RNA Biology at the Department of Biology, employs researchers working on a wide range of topics in bioinformatics and includes faculty with shared appointments in the Department of Computer Science.

The Centre is now one of the strongest interdisciplinary research environments in Denmark. Over the years, the researchers in the Bioinformatics Centre have made numerous prominent research contributions and continue to produce world-class research results in the bioinformatics field.


Research highlights

 

The Bioinformatics Centre offers a range of free on-line services, open-access databases and open source software packages.

ANGSD
ANGSD is a software for analyzing next generation sequencing data. The software can handle a number of different input types from mapped reads to imputed genotype probabilities. Most methods take genotype uncertainty into account instead of basing the analysis on called genotypes. This is especially useful for low and medium depth data.
   
   
BARNACLE logo BARNACLE
BARNACLE is a Python library for RNA 3D structure prediction. It can be used for probabilistic sampling of RNA structures that are compatible with a given nucleotide sequence and that are RNA like on a local length scale.
   
BASILISK logo BASILISK
BASILISK is a probabilistic model of the conformational space of amino acid side chains in proteins. Unlike rotamer libraries, BASILISK models the chi angles in continuous space, including the influence of the protein's backbone.
   
  Bayesembler
The Bayesembler is a Bayesian method for doing transcriptome assembly from RNA-seq data.
   
   
BloodSpot logo BloodSpot
BloodSpot is a database that provides gene expression profiles of genes and gene signatures in healthy and malignant hematopoiesis and includes data from both humans and mice.
   
  BWA-PSSM
BWA-PSSM is a probabilistic short read aligner based on the use of position specific scoring matrices (PSSM). Like many of the existing aligners it is fast and sensitive. Unlike most other aligners, however, it also adaptible in the sense that one can direct the alignment based on known biases within the data set. It is coded as a modification of the original BWA alignment program and shares the genome index structure as well as many of the command line options.
   
   
  HemaExplorer
HemaExplorer is an easy webtool for visualization of gene expression in the hematopoetic system. The webserver takes one or two genes as query and provides plots of the expression of the gene in cells involved in haematopoiesis. Currently the database contains options for the human normal myeloid system, human AML and mouse hematopoietic system.
   
JASPAR logo JASPAR
JASPAR is the leading open-access database of matrix profiles describing the DNA-binding patterns of transcription factors and other proteins interacting with DNA in a sequence-specific manner.
   
Kaiju logo Kaiju
Kaiju is a program for the taxonomic assignment of high-throughput sequencing reads from whole-genome sequencing of metagenomic DNA.
   
   
Mocapy logo Mocapy++
Mocapy++ is a Dynamic Bayesian Network toolkit, implemented in C++. It supports discrete, multinomial, Gaussian, Kent, Von Mises and Poisson nodes. Inference and learning is done by Gibbs sampling/Stochastic-EM.
   
  Phaistos logo Phaistos
Phaistos is a collection of tools for proteins structure prediction. It currently features the FB5DBN and TorusDBN models, which make it possible to sample protein structures compatible with a given amino acid and/or secondary structure sequence.
   
Phobius logo Phobius
Phobius is a server for prediction of transmembrane topology and signal peptides from the amino acid sequence of a protein.
   
   
Saqqaq logo Saqqaq Genome Project
The primary data from the saqqaq genome project: the sequencing of an Ancient Human Genome, obtained from a permafrost-preserved hair, about 4,000 years old, of a male palaeo-Eskimo of the Saqqaq culture, the earliest known settlers in Greenland.

 

 

 

 

 

 

 

 

 

The Section has a long and proud tradition in education of Master's students. The section has recently added a Bioinformatics Bachelor program to bolster student strength, curriculum continuum and graduate excellency. Bachelor's and Master's students are taught by our prominent faculty and, at program completion earn the Bachelor of Science in Bioinformatics and the Master of Science in Bioinformatics, respectively. Two Master's tracks are available: Computational Biology and Computer Science.

  • The Computational Biology track is for students who want to add bioinformatics skills to experimental backgrounds in biology, biochemistry and biomedicine
  • The Computer Science track is for students who want to apply programming knowledge to innovation and problem-solving in biotechnology and biomedicine or related fields.

In addition, the Section contributes to the education of biology, biochemistry and biomedicine students.

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

Cheng Chen
Unsupervised learning of multi-omics and phenotype data in the UK Biobank
Date: 23-07-2024

Qianna (No surname)
Study of taxonomic ambiguity of nanopore sequencing from microbial seawater communities
Date: 27-06-2024

Claes Ottzen Laurentiussen
Deconvolution of bulk transcriptomics of the liver using single-cell references to investigate cell type-specific associations with fibrosis from metabolic dysfunction
Date: 27-06-2024

Maria Elisabeth Harder
Type 2 diabetes in Greenland: prevalence and identification of risk factors
Date: 26-06-2024

Thibault Philippe Christian Bret
Exploring the Phylogenomics of Acetic Acid Bacteria: An Evolutionary Genomics Perspective
Date: 21-06-2024

Ziyi Sheng
Deep learning in CRISPR-Cas12a gRNA design
Date: 21-06-2024

Rasmus Alex Buntzen-Frederiksen
Predicting contaminated DNA samples within the class of Insecta from DNA and image embeddings
Date: 21-06-2024

Ida Anna Kups
Genome-wide investigation of the impact of epigenetic perturbations on expression of endogenous retroviruses in cancers
Date: 20-06-2024

Yan Li
Development of a deep generative model for cancer gene expression and clinical data combined
Date: 20-06-2024

Qingyang Liu
Deep learning for optimal CRISPR sgRNA design in cancer cells
Date: 20-06-2024

Yinghan Jiang
Exploring the Role of MAFLD-Associated Molecular Risk Markers and Complications along the Liver-Kidney, Liver-Heart, and Liver-Brain Axes in the UK Biobank Cohort
Date: 20-06-2024

Beatrice Hahn
Prediction of externally visible characteristics for forensic applications using UK Biobank data
Date: 19-06-2024

Abdullah Faqih Al Mubarok
Neural Network for Investigating Sequence Determinants of m6A Redistribution During Hypoxia Response
Date: 14-06-2024

Andreas Sandnes
Utilizing Differential Gene Expression Analysis Tools for DNA-Encoded Library Data Partitioning
Date: 14-06-2024

Yifan Sun
Extension and application of a side-chain customization protocol for receptor probe and drug discovery
Date: 13-06-2024

Íñigo Ríos Arroyo
Using Protein Structure Models To Predict N-Glycosylation
Date: 12-06-2024

Lucas Phillip Krieger
Exploring the Capabilities of Protein Language Models at Predicting Glycosylation
Date: 12-06-2024

Mads Harder Møller
CRISPR Base Editing Libraries Targeting Active Sites and Druggable Pockets
Date: 12-06-2024

Jiachen Yao
Computational Investigation of Anti-defense Islands in Mobile Genetic Elements
Date 11-06-2024

Wenhao Gao
Efficient detection of CRISPR/Cas9 bulged off-targets with CRISPRoff2
Date: 11-06-2024

Tobias Svend-Aage Beyer-Pedersen
Genotyping method for human repeats
Date: 06-06-2024

Kennie Lind Rothmann
ATC mapping to enhance research of prescription data for precision medicine
Date: 28-05-2024

Marta Menta Czinkoczky
Untargeted global proteomics analysis of canine Dilated Cardiomyopathy
Date: 24-04-2024

Zimeng Liu
Development of an Automated Computational Pipeline for Deriving the Protonation State of Relevant Histidines from Crystal Structures
Date: 27-02-2024

Jorge Hernansanz Biel
Validating the soft-clustering method in Type-2 diabetic cohorts with commonly-measured clinical variables used in the Danish public healthcare system
Date: 24-01-2024

Jianpeng Zheng
Synthetic data comparison from model DGD, ADS-GAN and RT-VAE
Date: 10-01-2024

Weikun Liu
Efficient Simulation of realistic Quantum Dot Arrays
Date: 29-09-2023

Yun Qin
Cross-species comparison of conserved neuron cell types important in spatial processing
Date: 14-09-2023

Yaxin Jing
Exploring the mediation of insulin resistance variants in their risk of type 2 diabetes according to their associations with adiposity and body composition
Date: 14-09-2023

JP Carrapico Rodrigues de Mendonca
Methodological Developments for Transcriptional Decomposition
Date: 06-09-2023

 

 

 

 

 

Contact

Section for Computational
and RNA Biology

Ole Maaløes Vej 5
DK-2200 Copenhagen N

SECTION HEAD
Associate Professor Jeppe Vinther
jvinther@bio.ku.dk