21st IEEE International Workshop on High Performance Computational Biology
May 30, 2022
Virtual

   In conjunction with the IEEE International Parallel and Distributed Processing Symposium

Announcements:


Confirmed Keynote and Invited Speakers

Christina
Christina Boucher [Keynote Speaker] 
Associate Professor, Department of Computer and Information Science
University of Florida, Gainesville, FL, USA

Title: Building scalable indexes that can be efficiently queried

Abstract:
    Recently, Gagie et al. proposed a version of the FM-index, called the r-index, that can store thousands of human genomes on a commodity computer. We later showed how to build the r-index efficiently via a technique called prefix-free parsing (PFP) and demonstrated its effectiveness for exact pattern matching. Exact pattern matching can be leveraged to support approximate pattern matching but the r-index itself cannot support efficiently popular and important queries such as finding maximal exact matches (MEMs). To address this shortcoming, Bannai et al. introduced the concept of thresholds, and showed that storing them together with the r-index enables efficient MEM finding --- but they did not say how to find those thresholds. We present another novel algorithm that applies PFP to build the r-index and find the thresholds simultaneously and in linear time and space with respect to the size of the prefix-free parse. Our implementation can rapidly find MEMs between reads and large sequence collections of highly repetitive sequences. Compared to existing methods, ours used 2 to 11 times less memory and was 2 to 32 times faster for index construction. Moreover, our method was less than one thousandth the size of competing indexes for large collections of human chromosomes.

Biography:
Dr. Boucher is an Associate Professor in the Department of Computer and Information Science and Engineering at the University of Florida. She has over 85 publications in bioinformatics, with over two dozen of them in succinct data structures and/or alignment. Considering this, she was a keynote speaker for IABD 2019, FAB 2018, RECOMB-SEQ 2016 and the ECCB 2016 Workshop on Pan-Genomics. She is a recipient of an ESA 2016 Best Paper Award. She oversees the development and maintenance of several software methods, including MEGARes and AMRPlusPlus, METAMarc, Kohdista, Vari, VariMerge — and most recently, Moni. In addition, she has built a team of collaborators in various biomedical sciences including microbiology, veterinarian medicine, epidemiology, public health, and clinical sciences. In addition, she actively works on increasing the diversity in bioinformatics education. Her efforts include being a member of the University of Florida’s Implicit Bias committee, being a panelist for the NSF-funded ACM BCB 2015 Women in Bioinformatics meeting, serving as a faculty advisor for an ACM-W chapter, and being an active member of the Diversity Committee for over three years. She also received a fellowship from The Institute for Learning and Teaching (TILT) for her course redevelopment and served on the advisory committee for an NSF Research Traineeships Program. She has been the PC chair for several conferences, including SPIRE 2020, RECOMB-Seq 2019, and ACM-BCB 2018. Most recently, she was nominated to serve on the NIH BDMA Study Section as a Standing Member, and a Board Member of SIG BIO.
Yatish
Yatish Turakhia  [Invited Speaker] 
Assistant Professor, Department of Electrical and Computer Engineering
University of California San Diego,  CA, USA

Title: Pandemic-scale phylogenetics

Abstract:
    Phylogenetics has been central to the genomic surveillance, epidemiology and contact tracing efforts during the COVD-19 pandemic. But the massive scale of genomic sequencing has rendered the pre-pandemic tools quite inadequate for comprehensive phylogenetic analyses. In this talk, I will discuss a high-performance computing (HPC) phylogenetic package that we developed to address the needs imposed by this pandemic. Orders of magnitude gains were achieved by this package through several domain-specific optimization and parallelization techniques. The package comprises four programs: UShER, matOptimize, RIPPLES and matUtils. Using high-performance computing, UShER and matOptimize maintain and refine daily a massive mutation-annotated phylogenetic tree consisting of all (>9M currently) SARS-CoV-2 sequences available on online repositories. With UShER and RIPPLES, individual labs – even with modest compute resources – incorporate newly-sequenced SARS-CoV-2 genomes on this phylogeny and discover evidence for recombination in real-time. With matUtils, they rapidly query and visualize massive SARS-CoV-2 phylogenies. This has empowered scientists worldwide to study the SARS-CoV-2 evolutionary and transmission dynamics at an unprecedented scale, resolution and speed. This has laid the groundwork for future genomic surveillance of MOST infectious pathogens.

Biography:
Dr. Turakhia is an Assistant Professor in the Department of Electrical and Computer Engineering at the University of California San Diego since July 2021. He is also affiliated with the Department of Computer Science and Engineering and the Bioinformatics and Systems Biology graduate program at UCSD. His lab is also affiliated with the Center of Machine-Integrated Computing and Security and Center of Microbiome Innovation at UCSD. Previously, he was a postdoctoral scholar at the Genomics Institute at UC Santa Cruz. Dr. Turakhia obtained his Ph.D. in Electrical Engineering in 2019 from Stanford University and his bachelor’s and master’s degrees in Electrical Engineering from the Indian Institute of Technology (IIT), Bombay in 2014.
        HiCOMB 2022 Call For Papers

The size and complexity of genomic and biomedical big data continue to grow at a furious pace, and the analysis of these complex, noisy, data sets demands efficient algorithms and high performance computing architectures. Hence, high-performance computing (HPC) has become an integral part of research and development in bioinformatics, computational biology, and medical and health informatics. The goal of the HiCOMB workshop is to showcase novel HPC research and technologies to solve data- and compute-intensive problems arising from all areas of computational life sciences. The workshop will feature contributed papers as well as invited talks from reputed researchers in the field.

For peer-reviewed papers, we invite authors to submit original and previously unpublished work that are at the intersection of the "pillars" of modern day computational life sciences and HPC.  More specifically, we encourage submissions from all areas of biology that can benefit from HPC, and from all areas of HPC that need new development to address the class of computational problems that originate from biology.

Areas of interest within computational life sciences include (but not limited to):

Areas of interest within HPC include (but are not limited to):

Submission guidelines

To submit a paper, please upload a PDF file through the Linklings HiCOMB 2022 submission link:
https://ssl.linklings.net/conferences/ipdps/?page=Submit&id=HiCOMBWorkshopFullSubmission&site=ipdps2022

IPDPS workshops can have submission in three categories: regular papers (up to 10 pages), short papers (up to 4 pages), and extended abstracts (1 page). Submitted manuscripts may not exceed ten (10) single-spaced double-column pages using a 10-point size font on 8.5x11 inch pages (IEEE conference style), including figures, tables, and references (see IPDPS Call for Papers for more details). All papers will be reviewed by three or more referees. This year, the authors of the accepted papers will be given a choice on whether to have the paper appear in the IPDPSW Proceedings (which will be digitally indexed and archived as part of the IEEE Xplore Digital Library). If the authors choose not to make it part of the proceedings, then the paper will not be considered archival. In either case, all accepted papers will be posted online on the workshop website, and all accepted papers (archived or not) will need to have an oral presentation at the workshop by one of the authors of the paper.


Important Dates

Workshop submission deadline
(for all categories):

January 22, 2022  February 5, 2022, 11:59pm AoE (extended deadline)
Author notification: February 28, 2022
Final camera-ready papers deadline: March 15, 2022 
Workshop: May 30, 2022

Program Chair

Program Committee

General Chairs

Steering Committee Members


HiCOMB Archive

20th International Workshop on High Performance Computational Biology - HiCOMB 2021
19th International Workshop on High Performance Computational Biology - HiCOMB 2020
18th International Workshop on High Performance Computational Biology - HiCOMB 2019
17th International Workshop on High Performance Computational Biology - HiCOMB 2018
16th International Workshop on High Performance Computational Biology - HiCOMB 2017
15th International Workshop on High Performance Computational Biology - HiCOMB 2016
14th International Workshop on High Performance Computational Biology - HiCOMB 2015
13th International Workshop on High Performance Computational Biology - HiCOMB 2014
12th International Workshop on High Performance Computational Biology - HiCOMB 2013
11th International Workshop on High Performance Computational Biology - HiCOMB 2012
10th International Workshop on High Performance Computational Biology - HiCOMB 2011
9th International Workshop on High Performance Computational Biology - HiCOMB 2010
8th International Workshop on High Performance Computational Biology - HiCOMB 2009
7th International Workshop on High Performance Computational Biology - HiCOMB 2008
6th International Workshop on High Performance Computational Biology - HiCOMB 2007
5th International Workshop on High Performance Computational Biology - HiCOMB 2006
4th International Workshop on High Performance Computational Biology - HiCOMB 2005
3rd International Workshop on High Performance Computational Biology - HiCOMB 2004
2nd International Workshop on High Performance Computational Biology - HiCOMB 2003
1st International Workshop on High Performance Computational Biology - HiCOMB 2002