Creating and Analyzing Source Code Repository Models - A Model-based Approach to Mining Software Repositories

Markus Scheidgen, Martin Smidt, Joachim Fischer

2017

Abstract

With mining software repositories (MSR), we analyze the rich data created during the whole evolution of one or more software projects. One major obstacle in MSR is the heterogeneity and complexity of source code as a data source. With model-based technology in general and reverse engineering in particular, we can use abstraction to overcome this obstacle. But, this raises a new question: can we apply existing reverse engineering frameworks that were designed to create models from a single revision of a software system to analyze all revisions of such a system at once? This paper presents a framework that uses a combination of EMF, the reverse engineering framework Modisco, a NoSQL-based model persistence framework, and OCL-like expressions to create and analyze fully resolved AST-level model representations of whole source code repositories. We evaluated the feasibility of this approach with a series of experiments on the Eclipse code-base.

Download


Paper Citation


in Harvard Style

Scheidgen M., Smidt M. and Fischer J. (2017). Creating and Analyzing Source Code Repository Models - A Model-based Approach to Mining Software Repositories . In Proceedings of the 5th International Conference on Model-Driven Engineering and Software Development - Volume 1: MODELSWARD, ISBN 978-989-758-210-3, pages 329-336. DOI: 10.5220/0006127303290336

in Bibtex Style

@conference{modelsward17,
author={Markus Scheidgen and Martin Smidt and Joachim Fischer},
title={Creating and Analyzing Source Code Repository Models - A Model-based Approach to Mining Software Repositories},
booktitle={Proceedings of the 5th International Conference on Model-Driven Engineering and Software Development - Volume 1: MODELSWARD,},
year={2017},
pages={329-336},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0006127303290336},
isbn={978-989-758-210-3},
}


in EndNote Style

TY - CONF
JO - Proceedings of the 5th International Conference on Model-Driven Engineering and Software Development - Volume 1: MODELSWARD,
TI - Creating and Analyzing Source Code Repository Models - A Model-based Approach to Mining Software Repositories
SN - 978-989-758-210-3
AU - Scheidgen M.
AU - Smidt M.
AU - Fischer J.
PY - 2017
SP - 329
EP - 336
DO - 10.5220/0006127303290336