Tuesday, May 16
8-10am, Workshop: W1
Machine Learning as a Tool for Biomedical Research
Anna Sowa, Ph.D., Senior Scientific Consultant
Jordan Ramsdell, Scientific Consultant
Machine learning has emerged as a powerful tool for biomedical research in recent years. With the ability to analyze vast amounts of complex data sets, ML can help researchers uncover patterns and relationships, allowing scientists to leverage these insights for downstream decisions. In this workshop, we will explore real data and discuss how to make it accessible to ML algorithms and translate those results into meaningful discoveries for researchers. Attendees will be exposed to MLDevOps approaches to dealing with data and integrating those into a research workflow.
10:30am-12:30pm, Workshop: W6
Using Amazon Omics for Genomics End-to-End: From Raw Sequence to Querying Variants
William Van Etten, Ph.D., Senior Scientific Consultant
Take a hands-on tour of Amazon Omics and how to use it to build a complete end-to-end genomics data journey. Here you will learn how to store petabytes of raw genomics data at low cost, process data efficiently and at scale with secondary analysis workflows, and query population scale variant datasets. Only a laptop and a connection to the internet are required to participate.
Wednesday, May 17
The Future of Data Science in Biomedicine: New Approaches to Make FAIR Data a Reality
Ari Berman, Ph.D., CEO
Data science in life sciences and biomedical research has surged forward in the last few decades, and the community has seriously considered new data hygiene approaches to reach a more universal state of FAIR (Findable, Accessible, Interoperable, and Reproducible). Unfortunately, FAIR data remains a fleeting goal to reach. While there are pockets of excellence and portions of the greater scientific community are aligning on data strategies, governance, hygiene, and common formats, the community writ large is still quite resistant to making the changes necessary to reach those goals. In this session, a core group of experts from across the field will explore what practical technologies, approaches, and cultural approaches might lead the field to accomplish our FAIR goals sooner rather than later.
Effective Use of AWS ParallelCluster for Life Science Workloads
Chris Dagdigian, Co-Founder and Senior Technical Director of Infrastructure
This presentation will cover all of the tips and tricks we use to deploy the stack in various ‘omics and CompChem environments.
Data Product Management: The Key to Building Product-Centric Organizations
Stan Gloss, Co-Founder and Fellow
According to McKinsey, 80% of Digital Transformation projects fail to meet their objectives. The root cause of this problem is that there is no Digital Transformation without Data Transformation. Many organizations view their data as exhaust or a byproduct that needs to be managed instead of as the fuel that drives insightful decision-making. This talk will explore how organizations, like Recursion Pharmaceuticals, are moving from a project to a product mindset and implementing data product management to drive change from the ground up. Trends and the role data product management plays in monetizing data by developing data products that stakeholders desire will be discussed.
Thursday, May 18
ACE in Bioinformatics and Data-Intensive Sciences
Ari Berman, Ph.D., CEO
Laura Boykin, Ph.D., Senior Scientific Consultant
Shane Corder, Senior Scientific Consultant
This session will highlight global collaborative efforts in working with the African Centers of Excellence (ACE) in bioinformatics and data-intensive sciences since 2015. Speakers will give an update about this project and discuss the work to build the technology and infrastructure foundations for sustainable energy systems and constraints involved for achieving global growth.
2:40pm, Room 210
Plenary Closing Keynote: Trends from the Trenches
•Chris Dagdigian, Co-Founder and Senior Technical Director of Infrastructure
•Adam Kraut, Senior Director, Marketing and Technical Consultant
•Anna Sowa, Ph.D., Senior Scientific Consultant
Since 2010, the “Trends from the Trenches” presentation, given by Chris Dagdigian, has been one of the most popular annual traditions of the Bio-IT Program. The intent of the talk is to deliver a candid (and occasionally blunt) assessment of the best, the most worthwhile, and the most overhyped information technologies (IT) for life sciences. The presentation has helped scientists, leadership, and IT professionals understand the basic topics related to computing, storage, data transfer, networks, cloud, data science, and machine learning that are involved in supporting data-intensive science. In 2023, Chris will give the “Trends from the Trenches” presentation in its original “state-of-the-state address” followed by guest speakers giving podium talks on relevant topics. An interactive Q&A moderated discussion with the audience will follow. Come prepared with your questions and commentary for this informative and lively session.