* ICSE 2018 *
Sun 27 May - Sun 3 June 2018 Gothenburg, Sweden
Thu 31 May 2018 14:40 - 15:00 at E3 room - Data and Databases Chair(s): Helena Holmström Olsson

Building scalable big data programs currently requires programmers to combine relational (SQL) with non-relational code (Java, C#, Scala). Relational code is declarative — a program describes what the computation is and the compiler decides how to distribute the program. SQL query optimization has enjoyed a rich and fruitful history, however, most research and commercial optimization engines treat non-relational code as a black-box and thus are unable to optimize it. This paper empirically studies over 3 million SCOPE programs across five data centers within Microsoft and finds programs with non-relational code take between 45-70% of data center CPU time. We further explore the potential for SCOPE optimization by generating more native code from the non-relational part. Finally, we present 6 case studies showing that triggering more generation of native code in these jobs yields significant performance improvement: optimizing just one portion resulted in as much as 25% improvement for an entire program.

Slides (slides.odp)420KiB

Thu 31 May

Displayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change

14:00 - 15:30
Data and DatabasesSEIP - Software Engineering in Practice at E3 room
Chair(s): Helena Holmström Olsson Malmö University
14:00
20m
Talk
A Data Decomposition Method for Stepwise Migration of Complex Legacy Data
SEIP - Software Engineering in Practice
Andreas Martens adesso AG, Matthias Book University of Iceland, Volker Gruhn
DOI File Attached
14:20
20m
Talk
Mind The Gap: Can And Should Software Engineering Data Sharing Become A Path Of Less Resistance?
SEIP - Software Engineering in Practice
Ken Wallace BAE Systems
DOI File Attached
14:40
20m
Talk
Cross-Language Optimizations in Big Data Systems: A Case Study of SCOPE
SEIP - Software Engineering in Practice
Marija Selakovic TU Darmstadt, Germany, Michael Barnett , Madan Musuvathi Microsoft Research, Todd Mytkowicz Microsoft Research
File Attached
15:00
20m
Talk
Smelly Relations: Measuring and Understanding Database Schema Quality
SEIP - Software Engineering in Practice
Tushar Sharma Athens University of Economics and Business, Marios Fragkoulis , Stamatia Rizou , Magiel Bruntink Software Improvement Group, Diomidis Spinellis Athens University of Economics and Business
Pre-print Media Attached
15:20
10m
Talk
Q&A in this sesson
SEIP - Software Engineering in Practice