HiCOMB 2018

In conjunction with the IEEE International Parallel and Distributed Processing Symposium

Quick Links

Keynote and invited abstracts

James Taylor, Computing Chromosome Conformation

Gene regulation - control of when, where, and at what level genes are expressed - is a fundamental part of cell development and identity. Gene regulation involves complex coordination of DNA architecture at multiple scales, from the individual DNA bases to the organization of whole chromosomes. Chromosomes in the eukaryotic nucleus are organized in a coordinated non-random configuration that has a substantial influence on the regulation of gene expression, and thus cell state and identity. Achieving a global full understanding of gene regulation requires a multi-scale understanding of the function of the genome in its developmental and structural context. In recent years, our ability to understand this organization has substantially increased due to a variety of high-throughput assays. Chromatin interactions can be interrogated globally using high-throughput sequencing based approaches including Hi-C in both populations of cells and more recently single cells. Localization in single-cells can also be interrogated using fluorescence imaging approaches that are increasingly high-resolution and high-throughput. Here the I will discuss the computational challenges in analyzing and integrating these data types, and the resulting insights into our current understanding of how chromatin is organized. In addition, I will describe recent advances in software tools and infrastructure that help to facilitate the analyses of large-scale biological datasets. Onur Mutlu, Accelerating Genome Analysis: A Primer on an Ongoing Journey

Benjamin Langmead, Practical lessons from scaling read aligners to hundreds of threads

General-purpose processors can now contain many dozens of processor cores and support hundreds of simultaneous threads of execution. To make best use of these threads, genomics software must contend with new and subtle computer architecture issues. I will discuss tradeoffs related to lock types, input parsing strategies, batching, output striping and multiprocessing versus multithreading. I will also explore how the FASTQ file format -- its unpredictable record boundaries in particular -- can impede thread scaling. I'll suggest simple ways to change FASTQ files and similar formats that enable further improvements in thread scaling while maintaining essentially the same compressed file size. Finally, I will show how these improvements affect performance of the popular Bowtie, Bowtie 2 and HISAT alignment tools across various general-purpose architectures including Intel Skylake and Knight's Landing. Inanc Birol, Multi-index Bloom filters and spaced seeds for sequence mapping

Benedict Paten, Mapping to a population

The human reference genome is part of the foundation of modern human biology, and a monumental scientific achievement. However, because it excludes a great deal of common human variation, it introduces a pervasive reference bias into the field of human genomics. To reduce this bias, it makes sense to draw on representative collections of human genomes, brought together into reference cohorts. There are a number of techniques to represent and organize data gleaned from these cohorts, many using ideas implicitly or explicitly borrowed from graph based models. Here I survey our progress in this domain, and show how genome graphs, associated data structures and population genomics models can be used to efficiently map sequencing reads not to one genome, but simultaneously to the haplotypes of thousands of genomes.

HiCOMB 2018 Call For Papers

The size and complexity of genome- and proteome-scale data sets in bioinformatics continues to grow at a furious pace, and the analysis of these complex, noisy, data sets demands efficient algorithms and high performance computer architectures. Hence high-performance computing has become an integral part of research and development in bioinformatics, computational biology, and medical and health informatics. The goal of this workshop is to provide a forum for discussion of latest research in developing high-performance computing solutions to data- and compute-intensive problems arising from all areas of computational life sciences. We are especially interested in parallel and distributed algorithms, memory-efficient algorithms, large scale data mining techniques including approaches for big data and cloud computing, algorithms on multicores, many-cores and GPUs, and design of high-performance software and hardware for biological applications.

The workshop will feature contributed papers as well as invited talks from reputed researchers in the field.

Topics of interest include but are not limited to:

Bioinformatics data analytics
Biological network analysis
Cloud-enabled solutions for computational biology
Computational genomics and metagenomics
Computational proteomics and metaproteomics
DNA assembly, clustering, and mapping
Energy-aware high performance biological applications
Gene identification and annotation
High performance algorithms for computational systems biology
High throughput, high dimensional data analysis: flow cytometry and related proteomic data
Parallel algorithms for biological sequence analysis
Molecular evolution and phylogenetic reconstruction algorithms
Protein structure prediction and modeling
Parallel algorithms in chemical genetics and chemical informatics
Transcriptome analysis with RNASeq

Submission guidelines

To submit a paper, please upload a PDF file through Easy Chair at the HiCOMB 2018 Submission Site. Submitted manuscripts may not exceed ten (10) single-spaced double-column pages using a 10-point size font on 8.5x11 inch pages (IEEE conference style), including figures, tables, and references (see IPDPS Call for Papers for more details). All papers will be reviewed. Proceedings of the workshops will be distributed at the conference and are submitted for inclusion in the IEEE Explore Digital Library after the conference.

Important Dates

Workshop submissions due:	~~January 30, 2018~~ February 6th, 2018
Author notification:	February 26, 2018
Final Camera-ready papers due:	March 15, 2018
Workshop:	May 21, 2018

Keynote Speakers

James Taylor
Ralph S. O'Connor Associate Professor of Biology
Associate Professor of Computer Science
Johns Hopkins University

Onur Mutlu
Professor of Computer Science
ETH Zurich

Program Committee

Ariful Azad, Lawrence Berkeley Lab
Rayan Chikhi, CNRS, University of Lille 1
Faraz Hach, Simon Fraser University
Niina S. Haiminen, IBM
Fereydoun Hormozdiari, UC Davis
Ananth Kalyanaraman, Washington State University
Daisuke Kihara, Purdue University
Mehmet Koyuturk, Case Western Reserve University
Benjamin Langmead, Johns Hopkins University
Kamesh Madduri, Penn State
Paul Medvedev (Chair), Penn State
Alba Cristina Magalhaes Alves de Melo, University of Brasilia
Folker Meyer, Argonne National Lab
Rob Patro, Stony Brook University
Knut Reinert, Freie Universit?t Berlin
Jan Schroeder, The Walter and Eliza Hall Institute of Medical Research
Alexandros Stamatakis, Heidelberg Institute for Theoretical Studies
Sharma Thankachan, Georgia Tech
Jaroslaw Zola, University at Buffalo, SUNY

Workshop Organizer

Paul Medvedev
Penn State University
Email: pz*m*11@psu.edu (remove the stars)

Steering Committee Members

David A. Bader
College of Computing
Georgia Institute of Technology
Email:
Srinivas Aluru
College of Computing
Georgia Institute of Technology
Email:

HiCOMB Archive

16th	International Workshop on High Performance Computational Biology - HiCOMB 2017
15th	International Workshop on High Performance Computational Biology - HiCOMB 2016
14th	International Workshop on High Performance Computational Biology - HiCOMB 2015
13th	International Workshop on High Performance Computational Biology - HiCOMB 2014
12th	International Workshop on High Performance Computational Biology - HiCOMB 2013
11th	International Workshop on High Performance Computational Biology - HiCOMB 2012
10th	International Workshop on High Performance Computational Biology - HiCOMB 2011
9th	International Workshop on High Performance Computational Biology - HiCOMB 2010
8th	International Workshop on High Performance Computational Biology - HiCOMB 2009
7th	International Workshop on High Performance Computational Biology - HiCOMB 2008
6th	International Workshop on High Performance Computational Biology - HiCOMB 2007
5th	International Workshop on High Performance Computational Biology - HiCOMB 2006
4th	International Workshop on High Performance Computational Biology - HiCOMB 2005
3rd	International Workshop on High Performance Computational Biology - HiCOMB 2004
2nd	International Workshop on High Performance Computational Biology - HiCOMB 2003
1st	International Workshop on High Performance Computational Biology - HiCOMB 2002