Empirical study of the relationship between design patterns and code smells

Mahmoud Alfadel; Khalid Aljasser; Mohammad Alshayeb

doi:10.1371/journal.pone.0231731

. 2020 Apr 16;15(4):e0231731. doi: 10.1371/journal.pone.0231731

Empirical study of the relationship between design patterns and code smells

Mahmoud Alfadel ¹, Khalid Aljasser ², Mohammad Alshayeb ^2,^*

Editor: Jacopo Soldani³

PMCID: PMC7162509 PMID: 32298360

Abstract

Software systems are often developed in such a way that good practices in the object-oriented paradigm are not met, causing the occurrence of specific disharmonies which are sometimes called code smells. Design patterns catalogue best practices for developing object-oriented software systems. Although code smells and design patterns are widely divergent, there might be a co-occurrence relation between them. The objective of this paper is to empirically evaluate if the presence of design patterns is related to the presence of code smells at different granularity levels. We performed an empirical study using 20 design patterns and 13 code smells in ten small-size to medium-size, open source Java-based systems. We applied statistical analysis and association rules. Results confirm that classes participating in design patterns have less smell-proneness and smell frequency than classes not participating in design patterns. We also noticed that every design pattern category act in the same way in terms of smell-proneness in the subject systems. However, we observed, based on the association rules learning and the proposed validation technique, that some patterns may be associated with certain smells in some cases. For instance, Command patterns can co-occur with God Class, Blob and External Duplication smell.

1. Introduction

Design patterns (DP), are recurring solutions to common software design problems. DPs aim to improve reusability and reduce coupling [1]. DPs can ease communication among team members by using terminology instead of using traditional explanations [2]. Gamma et al. [3] identified 23 design patterns which can be divided into three categories: creational, structural and behavioral. Creational design patterns are those that deal with creating objects so that the created objects serve a purpose that is suitable to the situation. Structural design patterns are design techniques that facilitate software design by identifying simple ways to realize relationships among the different entities. Behavioral design patterns are communication patterns. Design patterns or design motifs refer to several classes of different roles. Each design motif plays role in source code [4]. DPs have an impact on software quality attributes such as maintainability, reusability and fault-proneness [3, 5]. Design patterns may affect software quality attributes positively or negatively [1, 2]. Design patterns may degrade the system performance and make code unnecessarily complex when they are not used in a proper manner [1]. Different studies reported different results for the impact of design patterns on software quality. Vokáč [6] found that participant design pattern classes have fewer defects. Additionally, Prechelt et al. [1] reported that design patterns can positively affect maintenance aspects. On the other hand, other studies [7–10] reported negative impact of design pattern on software quality. Therefore, the question about design patterns impact on software quality is still open due to such inconclusiveness.

Code smells are problems that appear in a fragment of code that make software hard to maintain and change [11]. In 1999, Fowler defined code smells as signs of design problems that can hamper software maintenance [11]. However, code smells, unlike design pattern, are not patterns but they are signs where developers can look for a concrete issue. Therefore, code smells can suggest places in code required to be examined more thoroughly. Similar to design patterns, researchers reported different results about the impact of code smells on various quality aspects. Li and Shatnawi [12] found that code smells and defect density are correlated. However, studies in [13–15] have shown that smells are useful indicators to explain maintenance issues. Several factors such as software domain and experience of programmer have been used to detect code smells [16, 17]. Hence, it is important for researchers to come up with other mechanisms to detect bad smells. Therefore, expanding other factors could help practitioners in better understanding bad smells relation with code.

Some recent works studied design patterns and code smells in different ways. The majority of them discussed the topic from refactoring points. For instance, studies [18–20] proposed tools that suggest fragments need refactoring. Other studies [21, 22] investigated the relation between design patterns and code smells in terms of their structural relationship. Furthermore, Wendorff [9] reported cases where design patterns may have negative impact on attributes like maintainability which may lead to introduce bad smells. McNatt and Bieman [23] reported that some design patterns such as Command, Proxy, Bridge and Observer may degrade the system performance and make code uncontrolled when they are not used in a proper manner.

Design patterns are associated with good software design and code while bad smells indicate a lack of design or code flaws. DPs and bad smells represent antagonistic structures; therefore, they are rarely investigated in the same research context. Considering the conflicting results of the impact of DPs on quality attributes and having ambiguous relations that relate code smells with maintainability aspects due to design patterns applications; the connection between DPs and code smells with special focus on their co-occurrences deserves more investigation. Moreover, only a few studies have investigated the direct relationship between DPs and code smells [24]. Thus, more studies are needed to rigorously investigate and contrast a generalized evidence of the impact of design patterns on code smells. Also, we explore the potential relationship between code smells and DPs at different granularity levels.

Descriptions of some patterns could reveal potential associations with smells. For example, the Strategy pattern creates a set of objects in classes that represent algorithms. However, data which might be needed by the objects is decoupled. Hence, such situation could introduce Feature Envy smell in the classes that hold data. The Factory Method pattern has a similar situation of the Strategy pattern. When a method in Factory Method pattern creates objects from other classes, it sends a set of parameters to set up an instance. Hence, the pattern could result in an association with Long Parameter Lists smell and Blob smell. Such hypotheses motivated us more to conduct this study to explore if there is a potential relationship between design patterns and code smells. In this study, we initiate an assessment of the relationship between both concepts using Java language.

The study of the relationship between design patterns and code smells can uncover other factors that could help code reviewers to have better understanding of code quality. The analysis of the relation between design patterns and code smells can provide software designers with the knowledge and guidance to employ design patterns properly. Understanding the relationship can provide code reviewers with indications about the occurrence of smells over the code. For example, knowing patterns that are not mutually connected with smells could help reviewers and testers to focus their efforts on other parts of the code. On the other hand, potential co-occurrences of design patterns and bad smells could also help testers focus on parts that might have smelly design pattern classes. Studying the relation between DPs and code smells can help in understanding their associations with other parts of the code structure. Inspired by the study of Walter and Alkhaeir [24], we focus on the occurrence of code bad smells in code fragments that are part of design patterns, where more systems, design patterns, and code smells are analyzed. Also, we consider smell frequency factor in studying the potential co-occurrence. Moreover, we test the relationship between DPs and smells from the category level perspective. Lastly, we manually validate the significant cases that show potential relationships in order to gain more insights and draw more generalizable evidence about the code smells occurrence in DPs. The work presented in this paper has been extracted from a master thesis [25].

This study has four main contributions: 1) a confirmation study of previous works on the differences between classes involved in design patterns and classes that are not involved in design patterns in terms of smell proneness and smell frequency at class level, 2) an empirical evaluation of the differences in smell-proneness among classes that involve design patterns at DP category level, 3) an empirical evaluation of the differences in smell occurrences among classes that participate in a design pattern at motif level and 4) provide the dataset used in this study publicly for future replication.

The remainder of this paper is structured as follows. The related work is presented in Section 2. Section 3 describes the experiment setup. Section 4 highlights the experiment results while a discussion of the results is presented in Section 5 along with the limitations of the proposed work. Finally, Section 6 concludes the paper and recommends future research directions.

2. Related work

In this section, we review the related work with a special focus on the impact of DPs on software quality attributes, the impact of code smells on software quality attributes and the relationship between DPs and code smells.

2.1 The impact of design patterns on software quality attributes

Researchers investigated the impact of DPs on several quality attributes such as maintainability, change-proneness, performance and fault-proneness. Prechelt et al. [1] performed an experiment to investigate software maintenance that uses several design patterns and compared them with other scenarios with simpler alternatives. They found that, in many cases, design solutions in which GoF’s patterns are employed are easier to maintain than their corresponding simple solutions. Nonetheless, the authors also found that there are situations in which the use of design patterns made the maintenance of the program harder. They also confirmed that compared to a straightforward solution, design patterns may require additional maintenance tasks. Vokac et al. [26] replicated Prechelt et al.’s [1] study, finding that the Visitor pattern led to a high cost in terms of development time and poor accuracy. They also found that Decorator eases maintenance, despite the fact that it is hard to trace the control flow of the program and hence increases the effort required to understand it. Therefore, although its maintainability is good, it decreases understandability. Several studies [27–29] have been conducted to replicate the study in [1]. Some studies [27–29] found that DPs have a negative impact on maintainability while others (e.g. [30]) found no clear trend of any impact and recommended a deeper and a practical analysis be conducted. In addition, Garzas et al. [31] found that pattern-based designs make understandability and modifiability harder. On the other hand, Hegedus et al. [32] evaluated the impact of some DPs in JHotDraw systems on maintainability and found that DPs can affect maintainability positively and lead to improvements in the code.

Lutz Prechelt et al. [33] conducted a controlled experiment to investigate whether maintainer can perform better and fast if DPs in program code are explicitly documented using comments as compared when having only well-documented program without reference to DPs. They stated that patterns-based maintenance tasks were faster or with fewer problems as compared to the other way. Aversano et al. [34] performed a study to evaluate the relationship between DPs and change-proneness. The results show that DPs are more prone to changes under the condition that DPs play a significant role in system functionality. On the other hand, Gatrell et al. [35] found that some DPs participant classes are more prone to changes than non-participant classes. Bieman et al. [36] investigated the relationships between software change-proneness and design structure. Five open-source systems were used to conduct the experiment. Design patterns, size of a class and class inheritance participation were used to recognize design structure. The results of the study show that classes which participate in design patterns are more change-prone than other classes. This result may be justified by the fact that design patterns provide essential key responsibilities and functionality to the system. This may explain why classes that participate in design patterns are more change prone than other classes, relatively. Change-proneness in classes which participate in design patterns may be an indication of the co-occurrence of design patterns with either Divergent Change or Shotgun Surgery code smells, as defined by Fowler et al. [11]. Aversano et al. [34] concluded that design patterns appear to produce code which has a greater resistance to change. Moreover, they discovered that patterns are more suitable for applications which usually change more often.

The literature refers to some DPs which contribute to the presence of faults. Several studies discuss this issue [6, 37–39]. Vocak [6] studied the relationship between DPs and the number of faults using C++ systems. The results show different trends for different patterns. For example, Singleton and Observer were found to have more faults, due to their huge structural associations. In contrast, Factory Methods were subject to fewer faults because Factory Method classes are loosely coupled to the system structure. Additionally, no clear trend was found for the Template Method and Decorator patterns. The relationship between DPs and fault-proneness was investigated by Gatrell and Counsell [37]. The authors observed that classes participating in DPs are more prone to faults than classes not participating in DPs, especially for the Adaptor, Singleton and Template Method. Their study was conducted on ten Gang of Four (GoF) patterns using C# systems. Ampatzoglou et al. [38] evaluated the relationship between eleven DPs and fault frequency using Java systems. The results show both positive and negative relationship of the analyzed patterns to defect frequency. For example, the Adaptor pattern correlates positively with the Defect frequency, whereas Observer is negatively correlated with the Defect frequency.

From the previously mentioned studies, it can be observed that different studies reported different results for the impact of design patterns on software quality and the question about design patterns impact on software quality is still open due to such inconclusiveness.

2.2 The impact of code smells on software quality attributes

Code smells are perceived to lead to maintenance difficulties in software systems [13, 14, 40–42]. In addition, some studies claim that classes which have code smells are liable to be more change prone [40, 43, 44] and have more defects [40, 45–48] than classes that do not have code smells.

Olbrich et al. [40] studied the impact of God and Brain class smells on change size, change frequency and defects and found that classes with these code smells have more change size, change frequency and defects. The results of this study were further validated with more smells in [49, 50]. The studies [12, 43, 51] are in line with the findings of [40]. Interestingly, Olbrich et al. [40] found that when their results were normalized with respect to size i.e. Line of Code (LOC), the results no longer hold under the assumption that classes which are involved in code smells i.e. God Class and Brain Class have a ratio of functionality similar to other classes, on average. Thus, they concluded that code smells are not generally harmful and that code smells in classes may be an efficient way to organize code, providing that these smelly classes are constructed intentionally.

Abbes et al. [42] conducted an empirical study on the impact of anti-patterns i.e. Blob and Spaghetti Code, on program understandability. Their conclusion outlined that only one occurrence of Blob or one occurrence of Spaghetti does not significantly decrease program understandability. On the other hand, a combination of both Blob and Spaghetti affects program understandability and thus affects maintainability.

Jafaar et al. [45] studied the impact of design pattern classes that have dependencies with non-design pattern classes on defects and change-proneness. This study was conducted in the case of anti-pattern classes and non-anti-pattern classes. The results of their work show that classes which have dependencies on anti-patterns produce more defects those which have dependencies on design patterns. However, the results in [45] vary from one anti-pattern to another depending on the smell in addition to the analyzed software subjects. Khomh et al. [44] conducted an experimental study concerning the impact of anti-patterns on change-proneness and fault-proneness. They concluded that classes with anti-patterns are more change-prone and fault-prone. In addition, they found that structural changes affect classes participating in anti-patterns more than other classes.

Research community dedicated good efforts for understanding the code smells taking place in production code [52]. For instance, Tufano et al. [52] conducted an empirical study on a large set of open source projects (i.e. 200 projects). The objective of their study is to understand when and why code starts to smell. Therefore, the study was conducted over change history. Their results contradict common previous studies, where most of smells in the projects are initiated when the artifacts are created and not when they evolve. In addition, 80% of smells remain in the projects and only 9% of removed instances happened for refactoring purposes.

Code smells are not always bad; it depends on the situation in which code smells occurred in the source code. Hence, considering more smells in code individually could lead to different insights. Therefore, it would be interesting in the context of this study to analyze individual code smells code. The literature shows that studies investigated code smells and their relations with change-proneness, defects and understandability of systems.

2.3 The relationship between design patterns and code smells

The work that is most related to our study is the study of Walter and Alkhaeir [24]. In their work, they conducted an empirical study to evaluate the relationship between DPs and code smells at the class level. The authors examined this relation using 10 GoF’s design patterns and 7 code smells. They used two open-source projects, JfreeChart and Maven, with many subsequent releases of each project. The results indicate that the presence of DPs is not strongly associated with code smells, i.e., classes that participate in design patterns most likely are not smelly classes. However, these observations are more supported for certain patterns, such as Singleton and less supported for others, such as Composite. No clear trend was found regarding the evolution of smells over the analyzed pattern-based systems. The data provided by the study is limited to the class level and does not cover the category and role level of DPs. This is due to the limitation of the design patterns detection tool used in their study [53]. Generally, their findings designate that design pattern classes are connected to lower number of smells. Inspired by their study and supported by the fact that different design patterns may be associated with different code smells, in this study, we consider more systems, design patterns, and code smells. Furthermore, we consider smell frequency factor in studying the potential co-occurrence between design patterns and code smells. Moreover, we test the relationship between DPs and smells from the category level perspective. Lastly, we manually validate the significant cases that show potential relationships in order to gain more insights and to identify if a specific code smells or group of code smells is associated with a specific design pattern or group of design patterns. Unlike Walter and Alkhaeir study, we used a public manually validated design patterns dataset in order to increase the reliability of the results. Although our overall results show close findings to their study [24], we observed that some patterns are associated with certain smells. For instance, The Command pattern is found to be associated to the Blob, External Duplication and God Class smells. We conclude that there is a relationship between the existence of specific design patterns and bad smells in specific scenarios.

Codabux et al. [54] conducted an empirical study on the relation between code smells with class-level (micro pattern) and method-level (nano-pattern) traceable code patterns. They found that Immutable and Sink micro patterns are more frequent in classes having code smells. Cardoso and Figueiredo [55] performed an exploratory study to identify instances of co-occurrences of design patterns and bad smells using five systems. They found that the co-occurrences of Command with God Class and Template Method with Duplicated Code. Sousa et al. [56] conducted a case study with five Java systems to investigate the cooccurrence of design patterns and bad smells using software metrics. They focused on investigating if the use of design pattern reduces bad smell occurrence. They found that the application of design pattern not necessarily avoid bad smell occurrences. Sousa et al. [57] conducted a systematic literature mapping study on the relationship between design patterns and bad smells. They identified 16 primary studies and classified them into three approaches 1) impact on software quality, 2) refactoring 3) co-occurrence.

Khomh [21] proposed a quality model that takes into consideration several styles of design, including design patterns, code smells and anti-patterns. The author stated that the strength of the design structure is important in measuring the quality characteristics. Therefore, he analyzed how DPs and code smells can impact the quality characteristics, fault- and change- proneness. The results show that the model has a better and more accurate evaluation over the traditional metrics-based model.

Some works studied the refactoring techniques and tools that target improving software quality [58, 59]. Seng et al. [58] discussed the importance of maintenance that helps to eliminate code smells. Thus, they proposed an automa tic search-based approach to suggest the possible refactoring segments based on design patterns in the code of systems. The approach tends to improve the structure of software design without changing its behaviors. The approach was validated using only one open-source system called JhotDraw. Alshayeb [59] investigated the impact of software quality on refactoring to design patterns and found that there are no clear trends about refactoring or refactoring to patterns in quality improvements.

In addition to confirming the findings of prior studies, this study is different from existing studies in different ways. In particular, unlike the work of Walter and Alkhaeir [24], we address both smell-proneness and smell frequency whilst their work only considers smell-proneness. Also, we analyzed 20 design patterns and 13 code smells whilst their work examined 9 design patterns and 7 code smells. Furthermore, our work also targets different design pattern levels: class level, category level, motif level and role level (source-code based technique). Furthermore, for validation purposes, we use 10 different Java open source systems with the validated design pattern data. Lastly, we manually using a validation technique confirm a case of design patterns and code smells co-occurrence.

3. Empirical study setup

The main objective of this study is to explore if there is a relationship between design patterns and code smells. We define the following sub-objectives:

To empirically evaluate code smell-proneness and smell frequency in design pattern classes versus non-design pattern classes at the class level.
To empirically evaluate code smell-proneness in design pattern classes versus non-design pattern classes in design patterns categories (creational, structural and behavioral).
To empirically evaluate code smell-proneness for individual design motifs. This leads to identify roles of significant design patterns, if any.

The next sections present the details of the empirical study including: the research questions, the analyzed design patterns and code smells, data collection, research methodology and measurement tests.

3.1 Research questions

To achieve the objective of this study, we evaluate the impact of design patterns on code smells at the following different granularity levels:

Class Level: this level empirically evaluates the differences in terms of smell-proneness and smell frequency between classes involved in design patterns and classes that are not involved in design patterns i.e. (Smelly Design Pattern Classes (SDP) versus Smelly Non-Design Pattern Classes (SnDP)).
Category Level: this level empirically evaluates the differences in smell-proneness among classes that involve different categories of design patterns.
Motif Level: this level empirically evaluates the differences in number of smells occur among classes that involve a specific single design motif individually. In addition, in this level we identify the roles of significant design motifs, if any.

In relation to the aforementioned levels, we formulate the following research questions:

RQ1: Are design pattern classes more smell-prone and more frequent than non-design pattern classes?

RQ2: Do code smells have significant differences in terms of proneness when they are present in the different categories of design pattern classes?

RQ3: Are the participant classes in a specific individual design motif more smell-prone for a specific smell?

3.2 Research methodology

To achieve the research objective, we follow the methodology detailed in Fig 1. The methodology phases are: (1) the selection of subject systems and detection tools (2) the execution of the tools to obtain the related results of design patterns and code smell data and (3) data analysis and data mining.

In the first phase, we selected the systems to be used for analysis based on the P-Mart repository [60]. In addition, we selected the design patterns and code smell tools. The inFusion tool [61] was selected for code smell detection while the P-Mart repository was used for design pattern data. We kept the default values of the configuration parameters for code smell detection tool for possible replication.

In the second phase, we collected code smells and design pattern data of each system. For the design patterns, we used the available XML files in the P-Mart repository for each system and parsed them to get each design pattern class along with its corresponding design patterns. Hence, 10 files are resulted in this step. For code smells, we ran the tool over each system and collected the smelly classes along with the number of smells in each class with the type of smell. This step also resulted in 10 files for each system. Next, for each system, we matched both files of design patterns and code smells to be compared later. Therefore, we created one file per project to store the output results of the design patterns and their corresponding code smells. This makes the needed information of design patterns and code smells available for each class in each file in the systems. Each file has 25 columns; the first column is the class name followed by the occurrence of the corresponding design pattern in that class, if any. Column number three indicates the code smell occurrence (0 or 1) followed by the column showing the number of smells in that class. The fifth column specifies the type of smells the class may have. The remaining 20 columns refer to the corresponding design patterns types classified based on the categories of design patterns (i.e., Creational, Structural, and Behavior).

In the last phase, we performed statistical data analysis and data mining using the data in the stored files to analyze the results. Section 3.3 and Section 3.4 discuss more details of the data collection and statistical tests used in this study. Next, we explain the data collected in this study.

3.3 The used design patterns and code smells

In this work, we used 20 design patterns out of the 23 proposed by Gamma et al. that are available in the P-Mart repository [60]. Table 1 lists the design patterns used in this study.

Table 1. The used design patterns.

Name	Category
Abstract Factory, Builder, Factory Method, Prototype, Singleton	Creational
Adapter, Bridge, Composite, Facade, Decorator, Proxy	Structural
Command, Iterator, Mediator, Memento, Observer, State, Strategy, Template Method, Visitor	Behavioral

Name	Acronym
Data Class	DC
Data Clumps	DCl
Refused Parent Bequest Class	RPB
Schizophrenic Class	SC
Blob Methods	BL
Intensive Coupling	IC
Sibling Duplication	SD
Internal Duplication	ID
External Duplication	ED
God Class	GC
Feature Envy	FE
Tradition Breaker	TB
Message Chains	MC

Tools	# of detectable design patterns	Precision%	Recall%
SPQR [65]	1	-	-
MARRPLE [66]	3	78.6	78.3
DP-Miner [67]	4	91–100	-
WOP [68]	4	57.3	54.5
DPRE [69]	6	62–97	-
DPJF [70]	8	100	80
Similarity Scoring [53]	11	100	95.9
DeMIMA[71]	13	34	100
Pinot [72]	17	-	-
PTIDEJ [73]	20	65	100
FUJABA [74]	All (GoF) patterns	-	-

Systems	Language	Release no.	# Classes	LOC
DrJava	Java	v20020619	215	47,617
		v20020703	238	52,870
		v20020804	267	61,844
JHotDraw	Java	v5.1	155	16,085
MapperXML	Java	v1.9.7	217	32,667
Nutch	Java	v0.4	165	37,106
PMD	Java	v1.8	446	52,302
Junit	Java	v3.7	78	6,517
QuickUML	Java	v2001	156	23,319
Lexi	Java	v0.1.1	24	10,005
Total # (All Systems)			1961	340,332

Systems	# Classes	# & Percentage of DP Classes	# of Smelly DP Classes(SDP)	# of Smelly non-DP Classes(SnDP)	# &Percentage of Smelly Classes	# of Non-Smelly DP Classes(nSDP)	#of Non-Smelly non-DP Classes(nSnDP)	# & Percentage of Non-Smelly Classes
DrJava v20020804	267	85 (31.8%)	9	25	34 (12.7%)	76	157	233(87.3%)
JHotDraw v5.1	155	103 (66.5%)	5	3	8 (5.5%)	98	49	147(94.8%)
DrJava v20020619	215	41 (19.1%)	5	20	25 (11.6%)	36	154	190(88.4%)
DrJava v20020703	238	55 (23.1%)	5	23	28 (11.8%)	50	160	210(88.2%)
MapperXML v1.9.7	217	48 (22.1%)	6	21	27 (12.4%)	42	148	190(87.6%)
Nutch v0.4	165	41 (24.8%)	13	26	39 (23.6%)	28	98	126(76.4%)
PMD v1.8	446	43 (9.6%)	6	29	35 (7.8%)	37	374	411(92.2%)
JUnit v3.7	78	52 (67.7%)	2	2	4 (5.1%)	50	24	74(94.9%)
QuickUML 2001	156	41 (26.3%)	1	6	7 (4.5%)	40	109	149(95.5%)
Lexi v0.1.1 alpha	24	7 (29.2%)	2	10	12 (50%)	5	7	12(50%)
Total # (All Systems)	1961	516(26.3%)	54	165	219(11.2%)	462	1280	1742(88.8%)

Category	Patterns	drjava-20020804	JHotDraw v5.1	drjava-20020619	drjava-20020703	MapperXML v1.9.7	Nutch v0.4	PMD v1.8	JUnit v3.7	QuickUML	Lexi v0.1.1
Creational Patterns	Abs Factory					1				1
	Builder				1			2		1	1
	Factory Method	1	3		1	1		3
	Prototype		2
	Singleton	8	2	8		3	1		2	1	2
Structural patterns	Adapter	2	1	2	1	2	2	1
	Bridge	1					2
	Composite		1			1		2	1	1
	Decorator		1						1
	Façade					1
	Flyweight
	Proxy	1		1	1			1
Behavioral patterns	Chain of Responsibility
	Command	2	1	2			2			1
	Interpreter
	Iterator	1		1			1	1	1
	Mediator	1
	Memento	1		1			2
	Observer		2			1		2	3	1	2
	State	3	2		3
	Strategy	3	4	1	1	1	2
	Template	9	2		8	4	3	1
	Visitor	1		1				1
SUM		34	21	17	16	15	15	14	8	6	5

Research Objective	Test Type	Test Objective
Objective 1: Empirical evaluation of code smell-proneness and smell frequency in design pattern classes versus non-design pattern classes at class level	Wilcoxon signed-rank [80]	Compare the significance of the overall data at class level
	Odd Ratio (OR) [81]	Compare the significance of the overall data at class level
Objective 2: Empirical evaluation of code smell-proneness in design pattern classes versus non-design pattern classes in design patterns categories (creational, structural and behavioral).	Kruskal Wallis [82]	Measure multiple groups (categories) of data
	Mann-Whitney U [84]	Compare the pairs of categories.
Objective 3: Empirical evaluation of code smell-proneness for the individual design motifs.	Kruskal Wallis test [82]	Compare the design patterns in each category
	Apriori algorithm [83]	Test the association rules analysis
	Manual source code-based analysis	To validate significant rules, if any and identify roles participating in the pattern

Systems	Odd Ratio Value	Smell Event
QuickUML	.454	<1
Lexi	.280	<1
JUnit	.480	<1
JHotDraw	.833	<1
MapperXML	1.007	≈ 1
Nutch	1.750	>1
PMD	2.091	>1
DrJava2002619	1.069	≈ 1
DrJava2002703	.696	<1
DrJava2002804	.744	<1
All systems	.907	<1

Tested Variable	Group	Kolmogorov-Smirnov (Significance)	Shapiro-Wilk (Significance)
Smell-Proneness	Participant	<0.001	<0.001
Smell-Proneness	Non-Participant	<0.001	<0.001
Smell-Frequency	Participant	<0.001	<0.001
Smell-Frequency	Non-Participant	<0.001	<0.001

Metric	SDP/S	SnDP/S
Mean	0.297	0.703
Median	0.238	0.762
Std. Dev.	0.190	0.190
Variance	0.036	0.036

Systems	p-value
QuickUML	0.417
JHotDraw	0.872
MapperXML	0.333
PMD	0.338
DrJava2002619	0.081
DrJava2002804	0.986
All Systems	0.987

System/Pair	Creational vs Structural	Creational vs Behavioral	Structural vs Behavioral
QuickUML	0.386	0.317	1.000
JHotDraw	0.657	0.967	0.612
MapperXML	0.552	0.139	0.372
PMD	0.193	0.961	0.221
DrJava2002619	1.000	0.146	0.078
DrJava2002804	0.871	0.894	0.929
All Systems	0.907	0.976	0.874

Rule no.	Rules
R1	Command ⇒Blob
R2	Command ⇒GodClass
R3	Command ⇒ExternalDuplication
R4	Memento ⇒Blob
R5	Memento ⇒ExternalDuplication

Rule no.	Conviction Values
R1	1.16
R2	1.29
R3	1.11
R4	1.97
R5	1.61

DP\CS	DC	DCL	RPB	SC	BL	IC	SD	ID	ED	GC	FE	TB	MC	SUM^*	TOTAL⁺
Abs. Factory	2	-	-	1	1	-	-	-	-	-	-	1	-	5	18
Builder	-	1	1	-	-	-	2	-	-	-	1	-	-	5	25
Factory Method	2	1	-	3	4	1	-	1	2	1	1	1	-	17	95
Prototype	-	-	-	-	-	-	-	-	-	-	-	-	-	-	21
Singleton	-	-	-	-	1	-	-	-	-	-	-	-	-	1	27
Adapter	-	-	-	1	2	-	-	-	-	-	-	1	-	4	58
Bridge	-	-	-	-	-	-	-	-	-	-	-	-	-	-	28
Composite	2	-	1	2	-	1	-	1	3	-	1	-	-	11	102
Facade	1	-	-	-	-	-	-	-	-	-	-	-	-	1	3
Decorator	-	-	-	-	-	-	-	-	-	-	-	-	-	-	57
Proxy	-	-	-	1	-	1	-	1	2	-	1	-	-	6	8
Command	-	-	-	-	10	1	-	-	8	6	-	-	1	26	58
Iterator	-	-	-	-	3	1	2	-	-	2	-	-	-	8	17
Mediator	-	-	-	-	-	-	-	-	-	-	-	-	-	-	3
Memento	2	-	-	-	10	-	-	-	8	1	-	-	-	21	18
Observer	2	-	-	3	1	-	1	-	-	-	-	-	-	7	87
State	-	-	-	-	-	-	-	-	-	-	1	1	-	2	66
Strategy	-	-	-	1	-	1	2	-	-	-	1	-	-	5	90
Template Method	2	-	2	1	4	1	-	2	4	4	1	1	-	22	133
Visitor	-	-	-	-	-	-	-	-	-	1	-	-	-	1	7

Test Objective	Test Type	Test Results
Find the difference in smell-proneness among the overall design patterns in each category	Kruskal Wallis test	All categories have significant p-value at a 95% confidence level in the subject systems.
Explore the co-occurrence of each design pattern-code smell pairs	Learning association rules in the Apriori algorithm & source code-based analysis	Some rules show that there is a positive association between the presence of specific design patterns and code smells, as shown in Table 16. Moreover. We improved our analysis based on source code validation technique.

Research Question	Question Answer
RQ1: Are design pattern classes more smell-prone and more frequent than non-design pattern classes?	Classes participating in design patterns have less smell-proneness and smell-frequency than classes not participating in design patterns in the subject systems.
RQ2: Do code smells have significant differences in terms of proneness when they are present in the different categories of design pattern classes?	Every design pattern category act in the same way in terms of smell-proneness in the subject systems. Yet, some patterns appear to be more smell prone than others. Therefore, we propose RQ3.
RQ3: Are the participant classes in a specific individual design motif more smell prone for a specific smell?	A weak relation between the presence of most design patterns and the absence of most code smells. However, we observed that some patterns are associated with certain smells. The most noteworthy cases are the Command pattern with Blob and God Class smells. In addition, the Memento pattern was discovered to be connected to Blob and External Duplication smells.

PERMALINK

Empirical study of the relationship between design patterns and code smells

Mahmoud Alfadel

Khalid Aljasser

Mohammad Alshayeb

Roles

Abstract

1. Introduction

2. Related work

2.1 The impact of design patterns on software quality attributes

2.2 The impact of code smells on software quality attributes

2.3 The relationship between design patterns and code smells

3. Empirical study setup

3.1 Research questions

3.2 Research methodology

Fig 1. Research methodology.

3.3 The used design patterns and code smells

Table 1. The used design patterns.

Table 2. The used code smells.

3.4 Data collection

3.4.1 Research data

Table 3. Design patterns detection tools.

Table 4. Descriptive statistics on the analyzed systems.

3.4.2 Code smell detection

3.4.3 Analysis of granularity at design patterns and code smells detection

3.4.4 Design patterns and code smell data

Table 5. Statistics on smelly and design pattern classes.

Table 6. Design pattern instances in the subject systems.

Fig 2. Statistics of participant to non-participant classes.

3.5 Statistical tests

Table 7. Statistical tests used to achieve each objective.

4. Empirical study results

4.1 Smell-proneness and smell frequency evaluation at class level

4.1.1 Smell-proneness evaluation

Table 8. The odd ratio test analysis for smell-proneness evaluation of participant vs. non-participant design pattern classes groups.

Fig 3. Smell-proneness comparison of participant vs. non-participant design pattern classes in all systems.

Table 9. Test of normality.

Table 10. Statistics of smell-proneness in all systems.

Fig 4. The respective values of smell-proneness in SDP and SnDP.

4.1.2 Smell frequency evaluation

Fig 5. The respective values of smell frequency in SDP and SnDP.

Table 11. Statistics on smell frequency in all systems.

4.2 Smell-proneness evaluation at the category level

Table 12. P-values of evaluation design pattern categories using the Kruskal-Wallis test.

Fig 6. Smell-Proneness comparison of design pattern categories.

Table 13. P-values of the evaluation design pattern categories using the Mann Whitney test.

4.3 The impact of design patterns at the motif level

4.3.1 Evaluating the differences in smell-proneness among design pattern categories

Fig 7. Comparison of smell-proneness in the creational category.

Fig 8. Comparison of smell-proneness in the structural category.

Fig 9. Comparison of smell-proneness in the behavioral category.

Table 14. P-Value for the Kruskal Wallis test for each design pattern category.

4.3.2 Applying association rules for design patterns and code smell pairs

Table 15. Data of individual patterns with individual smells.

Table 16. List of significant association rules.

Table 17. The Conviction values of the identified rules.

4.3.3 Source code-based validation technique at role level

Table 18. Summary of results at design motif level.

5. Discussion

5.1 Co-occurrence between design patterns and code smells

Table 19. Summary of RQs and their answers.

5.2 Practical implication

5.2.1 Developers

5.2.2 Researchers

5.2.3 Tool builders

5.3Threats to validity

6. Conclusion and future work

Acknowledgments

Data Availability

Funding Statement

References

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases