Loading [MathJax]/extensions/MathMenu.js
Replicability Study: Corpora For Understanding Simulink Models & Projects | IEEE Conference Publication | IEEE Xplore

Replicability Study: Corpora For Understanding Simulink Models & Projects


Abstract:

Background: Empirical studies on widely used model-based development tools such as MATLAB/Simulink are limited despite the tools' importance in various industries. Aims: ...Show More

Abstract:

Background: Empirical studies on widely used model-based development tools such as MATLAB/Simulink are limited despite the tools' importance in various industries. Aims: The aim of this paper is to investigate the reproducibility of previous empirical studies that used Simulink model corpora and to evaluate the generalizability of their results to a newer and larger corpus, including a comparison with proprietary models. Method: The study reviews methodologies and data sources employed in prior Simulink model studies and replicates the previous analysis using SLNET. In addition, we propose a heuristic for determining code-generating Simulink models and assess the open-source models' similarity to proprietary models. Results: Our analysis of SLNET confirms and contradicts earlier findings and highlights its potential as a valuable resource for model-based development research. We found that open-source Simulink models follow good modeling practices and contain models comparable in size and properties to proprietary models. We also collected and distribute 208 git repositories with over 9k commits, facilitating studies on model evolution. Conclusions: The replication study offers actionable insights and lessons learned from the reproduction process, including valuable information on the generalizability of research findings based on earlier open-source corpora to the newer and larger SLNET corpus. The study sheds light on noteworthy attributes of SLNET, which is self-contained and redistributable.
Date of Conference: 26-27 October 2023
Date Added to IEEE Xplore: 08 November 2023
ISBN Information:
Conference Location: New Orleans, LA, USA

Funding Agency:

References is not available for this document.

I. Introduction

There are only a few empirical studies of open-source MATLAB/Simulink artifacts, maybe due to a widespread perception that open-source Simulink artifacts are typically small, do not represent closed-source development, and are often hard to acquire [14], [15], [27], [61], [70]. Most empirical Simulink studies to date have instead relied on academic-industry collaborations—to get access to large closed-source Simulink artifacts [6]. Most empirical results on Simulink development and artifacts are thus based on case-studies of closed-source artifacts that (even when providing detailed experimental design descriptions and measurement tools) are hard to reproduce or replicate [10].

Select All
1.
B. Adhikari, E. J. Rapos and M. Stephan, "Simulink model transformation for backwards version compatibility", MODELS-C, pp. 427-436, Oct. 2021.
2.
S. Ajami, Y. Woodbridge and D. G. Feitelson, "Syntax predicates idioms - what really affects code complexity?", Empir. Softw. Eng., vol. 24, no. 1, pp. 287-328, Feb. 2019.
3.
H. Altinger, S. Siegl, Y. Dajsuren and F. Wotawa, "A novel industry grade dataset for fault prediction based on model-driven developed automotive embedded software", MSR, pp. 494-497, May 2015.
4.
B. Balasubramaniam, H. Bagheri, S. Elbaum and J. Bradley, "Investigating controller evolution and divergence through mining and mutation*", ICCPS, pp. 151-161, 2020.
5.
M. M. Barón, M. Wyrich and S. Wagner, "An empirical validation of cognitive complexity as a measure of source code understandability", ESEM, pp. 5:1-5:12, Oct. 2020.
6.
V. Bertram, S. Maoz, J. O. Ringert, B. Rumpe and M. Von Wenckstern, "Component and connector views in practice: An experience report", MODELS, pp. 167-177, September 2017.
7.
A. Boll, F. Brokhausen, T. Amorim, T. Kehrer and A. Vogelsang, "Characteristics potentials and limitations of open-source Simulink projects for empirical research", Software and Systems Modeling, vol. 20, pp. 2111-2130, Apr. 2021.
8.
A. Boll, T. Kehrer, A. Vogelsang, T. Amorim and F. Brokhausen, Characteristics potentials and limitations of open source Simulink projects for empirical research: Dataset, 2021, [online] Available: https://doi.org/10.6084/m9.figshare.13636589.v2.
9.
A. Boll, Characteristics potentials and limitations of open source Simulink projects for empirical research: Dataset, 2021, [online] Available: https://doi.org/10.6084/m9.figshare.13636589.v1.
10.
A. Boll, N. Vieregg and T. Kehrer, "Replicability of experimental tool evaluations in model-based software and systems engineering with MATLAB/Simulink", Innovations in Systems and Software Engineering, pp. 1-16, Mar. 2022.
11.
H. Bourbouh, P.-L. Garoche, T. Loquen, E. Noulard and C. Pagetti, "Cocosim a code generation framework for control/command applications an overview of cocosim for multi-periodic discrete Simulink models", ERTS, 2020.
12.
D. T. Campbell and J. C. Stanley, Experimental and Quasi-Experimental Designs for Research, Houghton Mifflin Company, pp. 3, 1963.
13.
G. A. Campbell, "Cognitive complexity: An overview and evaluation", TechDebt, pp. 57-58, May 2018.
14.
S. A. Chowdhury, S. Mohian, S. Mehra, S. Gawsane, T. T. Johnson and C. Csallner, "Automatically finding bugs in a commercial cyber-physical system development tool chain with SLforge", ICSE, pp. 981-992, May 2018.
15.
S. A. Chowdhury, S. L. Shrestha, T. T. Johnson and C. Csallner, "SLEMI: Equivalence modulo input (EMI) based mutation of CPS models for finding compiler bugs in Simulink", ICSE, pp. 335-346, May 2020.
16.
S. A. Chowdhury, "SLEMI: finding simulink compiler bugs through equivalence modulo input (EMI)", Proc. 42nd International Conference on Software Engineering (ICSE) Companion Volume, pp. 1-4, May 2020.
17.
S. A. Chowdhury, L. S. Varghese, S. Mohian, T. T. Johnson and C. Csallner, "A curated corpus of Simulink models for model-based empirical studies", SEsCPS, pp. 45-48, May 2018.
18.
S. A. Chowdhury, R. Holmes, A. Zaidman and R. Kazman, "Revisiting the debate: Are code metrics useful for measuring maintenance effort?", Empir. Softw. Eng., vol. 27, no. 6, Nov. 2022.
19.
Y. Dajsuren, M. G. J. Van den Brand, A. Serebrenik and S. A. Roubtsov, "Simulink models are also software: Modularity assessment", QoSA, pp. 99-106, Jun. 2013.
20.
Targetlink, 2023, [online] Available: https://www.dspace.com/en/inc/home/products/sw/pcgs/targetlink.cfm.
21.
D. G. Feitelson, "Considerations and pitfalls for reducing threats to the validity of controlled experiments on code comprehension", Empir. Softw. Eng., vol. 27, no. 6, Nov. 2022.
22.
S. W. Flint, J. Chauhan and R. Dyer, "Pitfalls and guidelines for using time-based Git data", Empir. Softw. Eng., vol. 27, no. 7, pp. 1-55, Dec. 2022.
23.
W. Hu, T. Loeffler and J. Wegener, "Quality model based on iso/iec 9126 for internal quality of matlab/simulink/stateflow models", ICIT, pp. 325-330, 2012.
24.
A. Hwang, Video surveillance system design with Simulink and Xilinx FPGAs, 2022, [online] Available: https://www.mathworks.com/matlabcentral/fileexchange/20160-video-surveillance-system-design-with-simulink-and-xilinx-fpgas.
25.
M. Jaskolka, S. Scott, V. Pantelic, A. Wassyng and M. Lawford, "Applying modular decomposition in Simulink", ISSRE-W, 2020.
26.
G. Jay, J. E. Hale, R. K. Smith, D. P. Hale, N. A. Kraft and C. Ward, "Cyclomatic complexity and lines of code: Empirical evidence of a stable linear relationship", J. Softw. Eng. Appl., vol. 2, no. 3, pp. 137-143, 2009.
27.
Z. Jiang, X. Wu, Z. Dong and M. Mu, "Optimal test case generation for Simulink models using slicing", QRS-C, pp. 363-369, July 2017.
28.
E. Kalliamvakou, G. Gousios, K. Blincoe, L. Singer, D. M. Germán and D. E. Damian, "An in-depth study of the promises and perils of mining GitHub", Empir. Softw. Eng., vol. 21, no. 5, pp. 2035-2071, 2016.
29.
J. Krizan, L. Ertl, M. Bradac, M. Jasansky and A. Andreev, "Automatic code generation from MATLAB/Simulink for critical applications", CCECE, pp. 1-6, May 2014.
30.
D. Landman, A. Serebrenik, E. Bouwers and J. J. Vinju, "Empirical analysis of the relationship between CC and SLOC in a large corpus of Java methods and C functions", J. Softw. Eng. Appl., vol. 28, no. 7, pp. 589-618, Jul. 2016.

Contact IEEE to Subscribe

References

References is not available for this document.