Building Tools and Languages for Terabyte Scale Biology: A Call to Action (CurryOn 2017 - Curry On Talks)

Track

CurryOn 2017 Curry On Talks

Time Zone

The program is currently displayed in (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna.

Use conference time zone: (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, ViennaSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Tue 20 Jun 2017 10:25 - 11:05 at Auditorium, Vertex Building - Tuesday - 10:25 - 12:45 - Auditorium

Abstract

Biology is increasingly entering the fourth paradigm of science: tera/exabyte-scale data generation, with no single hypothesis in mind. These gigantic datasets are then searched for patterns that elucidate the biological processes that generated the measured data. The tools currently available to biologists, such as R and Python libraries, are not designed for datasets and algorithms that operate on ten thousand computer cloud clusters. Moreover, these libraries cannot be naively rewritten to leverage a distributed computing framework like Spark because these rich, high-dimensional datasets do not map well to the existing abstractions. In this talk, I’ll both describe the kinds of questions that the Biologists with massive datasets would like to ask and I’ll describe some of the tools my team is building to enable Statistical Genetics on datasets in the tens of terabytes.

Time Zone

The program is currently displayed in (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna.

Use conference time zone: (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, ViennaSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Tue 20 Jun
Displayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change

10:25 - 12:45	Tuesday - 10:25 - 12:45 - AuditoriumCurry On Talks at Auditorium, Vertex Building

10:25 40m Talk		Building Tools and Languages for Terabyte Scale Biology: A Call to Action Curry On Talks Daniel King Broad Institute
11:15 40m Talk		Preventing Information Leaks by Construction Curry On Talks Jean Yang Carnegie Mellon University
12:05 40m Talk		The Sharp Edges of Leaky Abstraction Curry On Talks Mark Allen Alert Logic

Building Tools and Languages for Terabyte Scale Biology: A Call to Action

Program Display Configuration

Program Display Configuration

Tue 20 JunDisplayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change

Daniel King

Broad Institute

Tue 20 Jun
Displayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change