Understanding the Challenges and Assisting Developers with Developing Spark Applications (ICSE 2021 - SRC - ACM Student Research Competition)

Track

ICSE 2021 SRC - ACM Student Research Competition

Time Zone

The program is currently displayed in (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna.

Use conference time zone: (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, ViennaSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Tue 25 May 2021 19:00 - 21:00 at SRC Room 2 - SRC Poster Session 2 Chair(s): Aurora Ramírez, Sergio Segura

Abstract

To process data more efficiently, big data frameworks provide data abstractions to developers. However, due to the abstraction, there may be many challenges for developers to understand and debug the data processing code. To uncover the challenges in using big data frameworks, we first conduct an empirical study on 1,000 Apache Spark-related questions on Stack Overflow. We find that most of the challenges are related to data transformation and API usage. To solve these challenges, we design an approach, which assists developers with understanding and debugging data processing in Spark. Our approach leverages statistical sampling to minimize performance overhead, and provides intermediate information and hint messages for each data processing step of a chained method pipeline. The preliminary evaluation of our approach shows that it has low performance overhead and we receive good feedback from developers.

Link to Preprint

http://arxiv.org/abs/2103.14177

File attachments

poster (poster.pdf)	5.20MiB

YT Video

Time Zone

The program is currently displayed in (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna.

Use conference time zone: (GMT+02:00) Amsterdam, Berlin, Bern, Rome, Stockholm, ViennaSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Tue 25 May
Displayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change

19:00 - 21:00	SRC Poster Session 2SRC - ACM Student Research Competition at SRC Room 2 Chair(s): Aurora Ramírez University of Córdoba, Sergio Segura Universidad de Sevilla

19:00 2h Poster		NodeSRT: A Selective Regression Testing Tool for Node.js ApplicationACM SRC SRC - ACM Student Research Competition Yufeng Chen University of British Columbia Pre-print Media Attached
19:00 2h Poster		Investigating the Interplay between Developers and AutomationACM SRC SRC - ACM Student Research Competition Omar Elazhary University of Victoria Pre-print Media Attached File Attached
19:00 2h Poster		WebEvo: Taming Web Application Evolution via Semantic Structure Change DetectionACM SRC SRC - ACM Student Research Competition Fei Shao Case Western Reserve University Media Attached
19:00 2h Poster		Understanding the Challenges and Assisting Developers with Developing Spark ApplicationsACM SRC SRC - ACM Student Research Competition Zehao Wang Concordia University, Montreal, Canada Pre-print Media Attached File Attached
19:00 2h Poster		Automation and evaluation of mutation testing for the new C++ standardsACM SRC SRC - ACM Student Research Competition Miguel Ángel Álvarez-García Universidad de Cádiz Pre-print Media Attached
19:00 2h Poster		ProMal: Precise Window Transition Graphs for Android via Synergy of Program Analysis and Machine LearningACM SRC SRC - ACM Student Research Competition Changlin Liu Case Western Reserve University Media Attached
19:00 2h Poster		Microservice-based performance problem detection in Cyber-Physical System software updatesACM SRC SRC - ACM Student Research Competition Aitor Gartziandia Media Attached
19:00 2h Poster		Please Don’t Go - Increasing Women’s Participation in Open Source SoftwareACM SRC SRC - ACM Student Research Competition Bianca Trinkenreich Northern of Arizona Univeristy Pre-print Media Attached
19:00 2h Poster		Explainable Bug Prediction for Code Changes: Are We There Yet?ACM SRC SRC - ACM Student Research Competition Reem Aleithan York University, Canada Media Attached
19:00 2h Poster		A Better Approach to Track the Evolution of Static Code WarningsACM SRC SRC - ACM Student Research Competition Junjie Li Pre-print Media Attached File Attached

Understanding the Challenges and Assisting Developers with Developing Spark Applications

ACM SRC

Tue 25 May
Displayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change

Zehao Wang

Concordia University, Montreal, Canada

Tracks

Co-hosted Conferences

Workshops

Contests

Co-hosted Symposia

Understanding the Challenges and Assisting Developers with Developing Spark Applications

ACM SRC

Program Display Configuration

Program Display Configuration

Tue 25 MayDisplayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change

Zehao Wang

Concordia University, Montreal, Canada

Tue 25 May
Displayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change