MSR 2014- Proceedings of the 11th Working Conference on Mining Software Repositories
Full Citation in the ACM Digital Library
SESSION: Keynote
Is mining software repositories data science? (keynote)
Audris Mockus
SESSION: Green Mining
Mining energy-greedy API usage patterns in Android apps: an empirical study
Mario Linares-Vásquez
Gabriele Bavota
Carlos Bernal-Cárdenas
Rocco Oliveto
Massimiliano Di Penta
Denys Poshyvanyk
GreenMiner: a hardware based mining software repositories software energy consumption framework
Abram Hindle
Alex Wilson
Kent Rasmussen
E. Jed Barlow
Joshua Charles Campbell
Stephen Romansky
Mining questions about software energy consumption
Gustavo Pinto
Fernando Castor
Yu David Liu
SESSION: Code Clones and Origin Analysis
Prediction and ranking of co-change candidates for clones
Manishankar Mondal
Chanchal K. Roy
Kevin A. Schneider
Incremental origin analysis of source code files
Daniela Steidl
Benjamin Hummel
Elmar Juergens
Oops! where did that code snippet come from?
Lisong Guo
Julia Lawall
Gilles Muller
SESSION: Bug Characterizing
Works for me! characterizing non-reproducible bug reports
Mona Erfani Joorabchi
Mehdi Mirzaaghaei
Ali Mesbah
Characterizing and predicting blocking bugs in open source projects
Harold Valdivia Garcia
Emad Shihab
An empirical study of dormant bugs
Tse-Hsun Chen
Meiyappan Nagappan
Emad Shihab
Ahmed E. Hassan
SESSION: Mining Repos and QA Sites
The promises and perils of mining GitHub
Eirini Kalliamvakou
Georgios Gousios
Kelly Blincoe
Leif Singer
Daniel M. German
Daniela Damian
Mining StackOverflow to turn the IDE into a self-confident programming prompter
Luca Ponzanelli
Gabriele Bavota
Massimiliano Di Penta
Rocco Oliveto
Michele Lanza
Mining questions asked by web developers
Kartik Bajaj
Karthik Pattabiraman
Ali Mesbah
Process mining multiple repositories for software defect resolution from control and organizational perspective
Monika Gupta
Ashish Sureka
Srinivas Padmanabhuni
SESSION: Mining Applications
MUX: algorithm selection for software model checkers
Varun Tulsian
Aditya Kanade
Rahul Kumar
Akash Lal
Aditya V. Nori
Improving the effectiveness of test suite through mining historical data
Jeff Anderson
Saeed Salem
Hyunsook Do
Finding patterns in static analysis alerts: improving actionable alert ranking
Quinn Hanam
Lin Tan
Reid Holmes
Patrick Lam
Impact analysis of change requests on source code based on interaction and commit histories
Motahareh Bahrami Zanjani
George Swartzendruber
Huzefa Kagdi
SESSION: Defect Prediction
An empirical study of just-in-time defect prediction using cross-project models
Takafumi Fukushima
Yasutaka Kamei
Shane McIntosh
Kazuhiro Yamashita
Naoyasu Ubayashi
Towards building a universal defect prediction model
Feng Zhang
Audris Mockus
Iman Keivanloo
Ying Zou
SESSION: Code Review and Code Search
The impact of code review coverage and code review participation on software quality: a case study of the qt, VTK, and ITK projects
Shane McIntosh
Yasutaka Kamei
Bram Adams
Ahmed E. Hassan
Modern code reviews in open-source projects: which problems do they fix?
Moritz Beller
Alberto Bacchelli
Andy Zaidman
Elmar Juergens
Thesaurus-based automatic query expansion for interface-driven code search
Otávio A. L. Lemos
Adriano C. de Paula
Felipe C. Zanichelli
Cristina V. Lopes
SESSION: Effort Estimation and Reuse
Estimating development effort in Free/Open source software projects by mining software repositories: a case study of OpenStack
Gregorio Robles
Jesús M. González-Barahona
Carlos Cervigón
Andrea Capiluppi
Daniel Izquierdo-Cortázar
An industrial case study of automatically identifying performance regression-causes
Thanh H. D. Nguyen
Meiyappan Nagappan
Ahmed E. Hassan
Mohamed Nasser
Parminder Flora
Revisiting Android reuse studies in the context of code obfuscation and library usages
Mario Linares-Vásquez
Andrew Holtzhauer
Carlos Bernal-Cárdenas
Denys Poshyvanyk
SESSION: Mining Mix
Syntax errors just aren't natural: improving error reporting with language models
Joshua Charles Campbell
Abram Hindle
José Nelson Amaral
Do developers feel emotions? an exploratory analysis of emotions in software artifacts
Alessandro Murgia
Parastou Tourani
Bram Adams
Marco Ortu
How does a typical tutorial for mobile development look like?
Rebecca Tiarks
Walid Maalej
Unsupervised discovery of intentional process models from event logs
Ghazaleh Khodabandelou
Charlotte Hug
Rebecca Deneckère
Camille Salinesi
SESSION: Short Research/Practice Papers
Tracing dynamic features in python programs
Beatrice Åkerblom
Jonathan Stendahl
Mattias Tumlin
Tobias Wrigstad
It's not a bug, it's a feature: does misclassification affect bug localization?
Pavneet Singh Kochhar
Tien-Duy B. Le
David Lo
Classifying unstructured data into natural language text and technical information
Thorsten Merten
Bastian Mager
Simone Bürsner
Barbara Paech
Collaboration in open-source projects: myth or reality?
Yuriy Tymchuk
Andrea Mocci
Michele Lanza
Improving the accuracy of duplicate bug report detection using textual similarity measures
Alina Lazar
Sarah Ritchey
Bonita Sharif
Undocumented and unchecked: exceptions that spell trouble
Maria Kechagia
Diomidis Spinellis
Innovation diffusion in open source software: preliminary analysis of dependency changes in the gentoo portage package database
Remco Bloemen
Chintan Amrit
Stefan Kuhlmann
Gonzalo Ordóñez–Matamoros
A dictionary to translate change tasks to source code
Katja Kevic
Thomas Fritz
New features for duplicate bug detection
Nathan Klein
Christopher S. Corley
Nicholas A. Kraft
Mining modern repositories with elasticsearch
Oleksii Kononenko
Olga Baysal
Reid Holmes
Michael W. Godfrey
SESSION: Mining Challenge
A study of external community contribution to open-source projects on GitHub
Rohan Padhye
Senthil Mani
Vibha Singhal Sinha
Understanding "watchers" on GitHub
Jyoti Sheoran
Kelly Blincoe
Eirini Kalliamvakou
Daniela Damian
Jordan Ell
Do developers discuss design?
João Brunet
Gail C. Murphy
Ricardo Terra
Jorge Figueiredo
Dalton Serey
Magnet or sticky? an OSS project-by-project typology
Kazuhiro Yamashita
Shane McIntosh
Yasutaka Kamei
Naoyasu Ubayashi
Security and emotion: sentiment analysis of security discussions on GitHub
Daniel Pletea
Bogdan Vasilescu
Alexander Serebrenik
Sentiment analysis of commit comments in GitHub: an empirical study
Emitza Guzman
David Azócar
Yang Li
Analysing the 'biodiversity' of open source ecosystems: the GitHub case
Nicholas Matragkas
James R. Williams
Dimitris S. Kolovos
Richard F. Paige
Co-evolution of project documentation and popularity within github
Karan Aggarwal
Abram Hindle
Eleni Stroulia
An insight into the pull requests of GitHub
Mohammad Masudur Rahman
Chanchal K. Roy
SESSION: Data Showcase
A dataset for pull-based development research
Georgios Gousios
Andy Zaidman
The bug catalog of the maven ecosystem
Dimitris Mitropoulos
Vassilios Karakoidas
Panos Louridas
Georgios Gousios
Diomidis Spinellis
A dataset of feature additions and feature removals from the Linux kernel
Leonardo Passos
Krzysztof Czarnecki
Kataribe: a hosting service of historage repositories
Kenji Fujiwara
Hideaki Hata
Erina Makihara
Yusuke Fujihara
Naoki Nakayama
Hajimu Iida
Kenichi Matsumoto
Lean GHTorrent: GitHub data on demand
Georgios Gousios
Bogdan Vasilescu
Alexander Serebrenik
Andy Zaidman
A code clone oracle
Daniel E. Krutz
Wei Le
Generating duplicate bug datasets
Alina Lazar
Sarah Ritchey
Bonita Sharif
FLOSS 2013: a survey dataset about free software contributors: challenges for curating, sharing, and combining
Gregorio Robles
Laura Arjona Reina
Alexander Serebrenik
Bogdan Vasilescu
Jesús M. González-Barahona
A green miner's dataset: mining the impact of software change on energy consumption
Chenlei Zhang
Abram Hindle
Gentoo package dependencies over time
Remco Bloemen
Chintan Amrit
Stefan Kuhlmann
Gonzalo Ordóñez–Matamoros
Models of OSS project meta-information: a dataset of three forges
James R. Williams
Davide Di Ruscio
Nicholas Matragkas
Juri Di Rocco
Dimitris S. Kolovos
A dataset of clone references with gaps
Hiroaki Murakami
Yoshiki Higo
Shinji Kusumoto
A dataset for maven artifacts and bug patterns found in them
Vaibhav Saini
Hitesh Sajnani
Joel Ossher
Cristina V. Lopes
OpenHub: a scalable architecture for the analysis of software quality attributes
Gabriel Farah
Juan Sebastian Tejada
Dario Correal
Understanding software evolution: the maisqual ant data set
Boris Baldassari
Philippe Preux