Search results for Genomics, bioinformatics and systems biology

1 - Molecular biology and high-throughput sequencing
from Part I - Preliminaries
Veli Mäkinen, University of Helsinki, Djamal Belazzougui, Centre de Recherche sur l’Information Scientifique et Technique (CERIST), Algiers, Fabio Cunial, Broad Institute, Massachusetts, Alexandru I. Tomescu, University of Helsinki
Book:

Genome-Scale Algorithm Design

Published online:

28 September 2023

Print publication:

12 October 2023, pp 3-9
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

This chapter gives a minimalistic, combinatorial introduction to molecular biology, omitting the description of most biochemical processes and focusing on inputs and outputs, abstracted as mathematical objects.

Part V - Applications
Veli Mäkinen, University of Helsinki, Djamal Belazzougui, Centre de Recherche sur l’Information Scientifique et Technique (CERIST), Algiers, Fabio Cunial, Broad Institute, Massachusetts, Alexandru I. Tomescu, University of Helsinki
Book:

Genome-Scale Algorithm Design

Published online:

28 September 2023

Print publication:

12 October 2023, pp 331-332
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

10 - Alignment-based genome analysis
from Part IV - Genome-Scale Algorithms
Veli Mäkinen, University of Helsinki, Djamal Belazzougui, Centre de Recherche sur l’Information Scientifique et Technique (CERIST), Algiers, Fabio Cunial, Broad Institute, Massachusetts, Alexandru I. Tomescu, University of Helsinki
Book:

Genome-Scale Algorithm Design

Published online:

28 September 2023

Print publication:

12 October 2023, pp 219-239
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

This chapter connects the alignment techniques and space-efficient data structures covered in earlier chapters. It shows how to use BWT indexes for alignining sequencing reads to a reference genome. This powerful read mapping procedure enables variant calling and genotyping of new individuals from a species whose reference genome has already been assembled.

6 - Alignments
from Part II - Fundamentals of Biological Sequence Analysis
Veli Mäkinen, University of Helsinki, Djamal Belazzougui, Centre de Recherche sur l’Information Scientifique et Technique (CERIST), Algiers, Fabio Cunial, Broad Institute, Massachusetts, Alexandru I. Tomescu, University of Helsinki
Book:

Genome-Scale Algorithm Design

Published online:

28 September 2023

Print publication:

12 October 2023, pp 83-128
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

An alignment of two sequences aims to highlight how much in common the two sequences have. In computational biology, an alignment is a prediction of the evolutionary steps between the two sequences. Different costs for such steps can be assigned, and then one can seek for an optimal alignment. This chapter gives a comprehensive introduction to the dynamic programming algorithms developed for various alignment formulations.

11 - Alignment-free genome analysis and comparison
from Part IV - Genome-Scale Algorithms
Veli Mäkinen, University of Helsinki, Djamal Belazzougui, Centre de Recherche sur l’Information Scientifique et Technique (CERIST), Algiers, Fabio Cunial, Broad Institute, Massachusetts, Alexandru I. Tomescu, University of Helsinki
Book:

Genome-Scale Algorithm Design

Published online:

28 September 2023

Print publication:

12 October 2023, pp 240-283
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

This chapter shows how to perform analysis and comparison of genomes without assuming a reference genome to be available. The bidirectional BWT index turns out to be essential here, and the chapter covers a comprehensive set of techniques to manipulate this data structure. The algorithms covered include computing maximal exact/unique matches, substring kernels, matching statistics, and Jaccard similarity.

15 - Pangenomics
from Part V - Applications
Veli Mäkinen, University of Helsinki, Djamal Belazzougui, Centre de Recherche sur l’Information Scientifique et Technique (CERIST), Algiers, Fabio Cunial, Broad Institute, Massachusetts, Alexandru I. Tomescu, University of Helsinki
Book:

Genome-Scale Algorithm Design

Published online:

28 September 2023

Print publication:

12 October 2023, pp 344-366
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Several large-scale studies aim to build comprehensive catalogs of all the variants in a population, for example all the frequent variants in a species or all the variants in a group of individuals with a specific trait or disease. Such catalogs are the substrate for subsequent genome-wide association studies that aim to correlate variants to traits, and ultimately to personalized treatments. Such catalogs can also be leveraged for making basic analysis tasks, such as read alignment, using not just one reference genome but a pangenome data structure representing all genomes in the catalogue. The chapter gives an overview of different pangenome data structures and their applications. Selected data structures are covered in more depth, including the r-index.

Part IV - Genome-Scale Algorithms
Veli Mäkinen, University of Helsinki, Djamal Belazzougui, Centre de Recherche sur l’Information Scientifique et Technique (CERIST), Algiers, Fabio Cunial, Broad Institute, Massachusetts, Alexandru I. Tomescu, University of Helsinki
Book:

Genome-Scale Algorithm Design

Published online:

28 September 2023

Print publication:

12 October 2023, pp 217-218
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

17 - Metagenomics
from Part V - Applications
Veli Mäkinen, University of Helsinki, Djamal Belazzougui, Centre de Recherche sur l’Information Scientifique et Technique (CERIST), Algiers, Fabio Cunial, Broad Institute, Massachusetts, Alexandru I. Tomescu, University of Helsinki
Book:

Genome-Scale Algorithm Design

Published online:

28 September 2023

Print publication:

12 October 2023, pp 394-413
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Assume that a drop of seawater contains cells from many distinct species. Sequencing such a mixed sample and figuring out the relative abundancy of every species is a key problem in metagenomics. This chapter explores techniques for metagenomics analysis in different settings, for example with and without assuming that reference sequences are available. To solve these problems, we use techniques including tailored k-mer-based analyses, bidirectional BWT indexing, and network flows.

3 - Data structures
from Part I - Preliminaries
Veli Mäkinen, University of Helsinki, Djamal Belazzougui, Centre de Recherche sur l’Information Scientifique et Technique (CERIST), Algiers, Fabio Cunial, Broad Institute, Massachusetts, Alexandru I. Tomescu, University of Helsinki
Book:

Genome-Scale Algorithm Design

Published online:

28 September 2023

Print publication:

12 October 2023, pp 21-41
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

This chapter presents the minimal setup of data structures required to follow the rest of the book in a self-contained manner. Balanced binary trees are enhanced to solve dynamic range minimum queries. Bitvector rank and select data structures and their extensions to larger alphabets with wavelet tree are covered. Then a special structure for solving static range minimum queries is derived. The chapter ends with a concise description of hashing primitives, such as perfect hashing, Bloom filters, minimizers, and the Rabin–Karp rolling hash.

Index
Veli Mäkinen, University of Helsinki, Djamal Belazzougui, Centre de Recherche sur l’Information Scientifique et Technique (CERIST), Algiers, Fabio Cunial, Broad Institute, Massachusetts, Alexandru I. Tomescu, University of Helsinki
Book:

Genome-Scale Algorithm Design

Published online:

28 September 2023

Print publication:

12 October 2023, pp 439-444
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

13 - Fragment assembly
from Part IV - Genome-Scale Algorithms
Veli Mäkinen, University of Helsinki, Djamal Belazzougui, Centre de Recherche sur l’Information Scientifique et Technique (CERIST), Algiers, Fabio Cunial, Broad Institute, Massachusetts, Alexandru I. Tomescu, University of Helsinki
Book:

Genome-Scale Algorithm Design

Published online:

28 September 2023

Print publication:

12 October 2023, pp 308-330
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

Throughout the book we mostly assume the genome sequence under study to be known. In this chapter we look at strategies for how to assemble fragments of DNA into longer contiguous blocks, and eventually into chromosomes. This chapter is partitioned into sections roughly following the workflow of a de novo assembly project, namely, error correction, contig assembly, scaffolding, and gap filling. Algorithms working with de Bruijn graphs and overlap graphs are studied.

Dedication
Veli Mäkinen, University of Helsinki, Djamal Belazzougui, Centre de Recherche sur l’Information Scientifique et Technique (CERIST), Algiers, Fabio Cunial, Broad Institute, Massachusetts, Alexandru I. Tomescu, University of Helsinki
Book:

Genome-Scale Algorithm Design

Published online:

28 September 2023

Print publication:

12 October 2023, pp v-vi
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

List of insights
Veli Mäkinen, University of Helsinki, Djamal Belazzougui, Centre de Recherche sur l’Information Scientifique et Technique (CERIST), Algiers, Fabio Cunial, Broad Institute, Massachusetts, Alexandru I. Tomescu, University of Helsinki
Book:

Genome-Scale Algorithm Design

Published online:

28 September 2023

Print publication:

12 October 2023, pp xiii-xiv
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Part I - Preliminaries
Veli Mäkinen, University of Helsinki, Djamal Belazzougui, Centre de Recherche sur l’Information Scientifique et Technique (CERIST), Algiers, Fabio Cunial, Broad Institute, Massachusetts, Alexandru I. Tomescu, University of Helsinki
Book:

Genome-Scale Algorithm Design

Published online:

28 September 2023

Print publication:

12 October 2023, pp 1-2
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

Preface
Veli Mäkinen, University of Helsinki, Djamal Belazzougui, Centre de Recherche sur l’Information Scientifique et Technique (CERIST), Algiers, Fabio Cunial, Broad Institute, Massachusetts, Alexandru I. Tomescu, University of Helsinki
Book:

Genome-Scale Algorithm Design

Published online:

28 September 2023

Print publication:

12 October 2023, pp xv-xx
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

5 - Network flows
from Part I - Preliminaries
Veli Mäkinen, University of Helsinki, Djamal Belazzougui, Centre de Recherche sur l’Information Scientifique et Technique (CERIST), Algiers, Fabio Cunial, Broad Institute, Massachusetts, Alexandru I. Tomescu, University of Helsinki
Book:

Genome-Scale Algorithm Design

Published online:

28 September 2023

Print publication:

12 October 2023, pp 53-80
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

In this chapter we show that many optimization problems can be reduced to a network flow problem. This polynomially solvable problem is a powerful model, which has found a remarkable array of applications. Roughly stated, in a network flow problem, one is given a transportation network and is required to find the optimal way of sending some content through this network. The chapter covers basic primitives around a general flow formulation called the minimum-cost flow.

Notation
Veli Mäkinen, University of Helsinki, Djamal Belazzougui, Centre de Recherche sur l’Information Scientifique et Technique (CERIST), Algiers, Fabio Cunial, Broad Institute, Massachusetts, Alexandru I. Tomescu, University of Helsinki
Book:

Genome-Scale Algorithm Design

Published online:

28 September 2023

Print publication:

12 October 2023, pp xxi-xxiv
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

References
Veli Mäkinen, University of Helsinki, Djamal Belazzougui, Centre de Recherche sur l’Information Scientifique et Technique (CERIST), Algiers, Fabio Cunial, Broad Institute, Massachusetts, Alexandru I. Tomescu, University of Helsinki
Book:

Genome-Scale Algorithm Design

Published online:

28 September 2023

Print publication:

12 October 2023, pp 414-438
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation

2 - Algorithm design
from Part I - Preliminaries
Veli Mäkinen, University of Helsinki, Djamal Belazzougui, Centre de Recherche sur l’Information Scientifique et Technique (CERIST), Algiers, Fabio Cunial, Broad Institute, Massachusetts, Alexandru I. Tomescu, University of Helsinki
Book:

Genome-Scale Algorithm Design

Published online:

28 September 2023

Print publication:

12 October 2023, pp 10-20
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

This chapter gives an introduction to complexity analysis, data representations, and reductions. In addition, the Knuth–Morris–Pratt algorithm is covered to give some taste of dynamic programming – a technique introduced in Chapters 4 and 6 and used extensively thereafter.

12 - Compression of genome collections
from Part IV - Genome-Scale Algorithms
Veli Mäkinen, University of Helsinki, Djamal Belazzougui, Centre de Recherche sur l’Information Scientifique et Technique (CERIST), Algiers, Fabio Cunial, Broad Institute, Massachusetts, Alexandru I. Tomescu, University of Helsinki
Book:

Genome-Scale Algorithm Design

Published online:

28 September 2023

Print publication:

12 October 2023, pp 284-307
- Chapter
- - Get access
    
    Check if you have access via personal or institutional login
    
    Log in Register
- Export citation
Summary

A pragmatic problem arising in the analysis of biological sequences is that collections of genomes, and especially collections of read sets consisting of material from many species, occupy too much space. This chapter explores techniques to efficiently compress such collections. Several algorithms related to Lempel–Ziv factorization are covered, as well as the prefix-free parsing technique to run-length encode the Burrows–Wheeler transform of a collection of genomes.

Genomics, bioinformatics and systems biology

Refine search

Refine search

Actions for selected content:

6077 results in Genomics, bioinformatics and systems biology

1 - Molecular biology and high-throughput sequencing

Summary

Part V - Applications

10 - Alignment-based genome analysis

Summary

6 - Alignments

Summary

11 - Alignment-free genome analysis and comparison

Summary

15 - Pangenomics

Summary

Part IV - Genome-Scale Algorithms

17 - Metagenomics

Summary

3 - Data structures

Summary

Index

13 - Fragment assembly

Summary

Dedication

List of insights

Part I - Preliminaries

Preface

5 - Network flows

Summary

Notation

References

2 - Algorithm design

Summary

12 - Compression of genome collections

Summary

Genomics, bioinformatics and systems biology

Refine search

Refine search

Actions for selected content:

Save Search

6077 results in Genomics, bioinformatics and systems biology

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary

Summary