Introduction
To produce the quality projects with low cost, the software organizations are motivated in transforming their development activities from collocated to global software development paradigm. Dhar [1] underlined that the software outsourcing includes allocation or transformation of development activities, management of development process, decision of management and services across the geographical boarder. With the aim to improve the development activities, various development approaches and platforms were developed. Therefore, the cloud computing (CC) is the most recent environment that assist the offshore software development paradigm.
The software organizations widely consider the cloud based global software development (CGSD) paradigm to perform their development tasks beyond the geographical and cultural boarder. The CGSD paradigm provides the dynamic, scalability and availability of distributed resources [2]. The CC provides the services like software as a service (SaaS), platform as a service (PaaS) and infrastructure as a service (IaaS) that motivates majority of software organizations to deploy the cloud services for CGSD. Though, the CGSD offers the software development organizations to share and access the geographically distributed IT resources and applications [3], [4]. CGSD reformed the business of software engineering industry.
The CGSD paradigm, the organizations working as clients transformed the development activities to vendor organizations for the development of quality projects within time limits and budgets [5]. Niazi et al. [6] mention that “the software organizations of developed countries outsource their development activities to the organizations of developing countries as the development cost is one third less in developing countries as compare to developed countries”. Moreover, Ramasubbu [7] mention that CGSD paradigm assists to hire the skilled labor with low cost from developing countries. Furthermore, Niazi et al. [8] argued that CGSD paradigm assists to reduce the development time by arranging the development activities around the globe with respect to the time zone differences.
Besides the significant gains, the management of development activities across the geographical distributed environment is complicated as it causes issues like communication, coordination and control [1], [9]. The geographical and cultural differences between the clients and vendor organizations causes serious problems e.g. time zone differences, lack of frequent and effective communication, and lack of trust and confidence [10]–[12]. Kaiser et al. [13] highlighted that the hidden cost causes the budget overrun due to which organization leads towards project failure.
Despite the CGSD importance in software industry, there is limited research available to resolve the problems of CGSD practitioners in global environment. Though, considering the importance of CGSD in software industry, we motivated to empirical explore the best practices that could assists the real-world practitioners for the successful adoption of CGSD paradigm. The study objectives include: (1) to identify the best practices of CGSD reported in the literature and in real-world practices; (2) prioritization of the investigated best practices concerning to their importance for CGSD paradigm. We are confident that the in-depth analysis of CGSD best practices will assist the academic researchers and industry experts to develop the new and effective techniques for the successful execution of CGSD activities. Though, to address the study aims, the developed research questions (RQ) are as follows:
[RQ1]:
What best practices of CGSD are reported in the literature and industry practices?
[RQ2]:
How to rank the investigated best practices with respect to their significance for CGSD paradigm?
[RQ3]:
What would be the prioritization-based taxonomy of investigated best practices?
Background and Motivation
The CC offer the CGSD firms to access the shared IT resources and applications [14]–[16].The most important CC services include: “on-demand self-service”, “virtualization”, “management of IT resources ”, “available over internet”, and “charged on a pay-per use basis” [4]. Moreover, CC offers SaaS, PaaS and IaaS and different types of cloud networks i.e. private, public, hybrid, community models [3]. The services of CC can be access remotely via internet which are managed and own by the service provider organization [3], [4]. The virtual availability of CC services provides the opportunity to software organizations to start their CGSD with low capital investment [17]. The public cloud is operated and management by the services provider (external body) to facilitate the general public via internet [17]. Besides, the private cloud is operated and won by an organization to assist their own distributed practitioners (sites) [3]. The private cloud is secure and there is very little threat towards the data security. In hybrid CC the organizations save their data on private cloud and public CC is used for other types of services [3], [4].
The business gains always the priority of every software firm and if the economic benefitsoffer along with additional opportunity like skilled human resources, low time, quality work andupdated technological tools etc. the organization consider such paradigm for log time [6], [11], [18]. Hence, the CGSD paradigm provides the environment for software organizations to conduct development activities across the globe aiming high quality product development reducing time and cost [19], [20]. In CGSD, the adjustment of development activities with respect to the time zone, significantly impact to reduce the development time [21]. Espino et al. [10] also highlighted that the CGSD provides the opportunity to keep in touch with global market quality and trend.
The global software development paradigm is adopted in software industry since last two decade, but the CGSD paradigm is still not mature enough. We found some studies conducted to highlight the problem of CGSD. Such as, a UK based study was conducted by Oza et al. [22] to address the relationship of CGSD practitioners. They conducted empirical study with Indian client organizations and the vendor organization of USA and European organizations. They reported that the good and cooperative relationship among client and vendor firms is critical for the successful execution of CGSD practices. We further identified the Nguyen et al. [23] study with the vendor organizations of Vietnam and client organizations of European and American countries. Similarly, Sabherwal [24] conducted an empirical study and reported the role of trust in CGSD environment. Raj-Kumar and Dawley [25] reported the critical risks of CGSD between Indian and US software organizations. The CGSD practitioners also faced various challenges that make the development activities more complicated. For example, Böhm et al. [2] and Chang and Gurbaxani [26] “reported the challenge “lack of frequent communication and coordination” between the CGSD practices. Dey et al. [27] argues that the activities of CGSD are more communication and” coordination oriented and there is physical meetings which cause issues like weak communication and coordination among team members
Besides the importance of CGSD in current era, limited studies are available that explore and fix the complexities of CGSD paradigm. Though, this research aims to explore the best practices of CGSD reported in the literature and in real-world practices; and prioritize them with respect to them critically for CGSD organizations. The study findings will deliver the prioritization-based taxonomy of the CGSD best practices that help industry practitioners to improve and develop the new approaches for CGSD.
Research Design
This study aims to identify and prioritize the best practices of CGSD paradigm. To answer the study RQs, we adopted following approaches.
Systematic literature review
Questionnaire survey approach
Fuzzy-AHP
All these research approaches are demographically shown in Figure 1, and describe in the following sections:
A. Systematic Literature Review (SLR)
An “SLR was considered to collect the most related literature according to the need study RQs. SLR is a systematic process to collect the most potential literature related with RQs. SLR provides the opportunity to explore, verify, analyses and synthesis the data from the existing literature [28]. The results of SLR study is less biased and thorough as compare to informal literature [29]. Various existing studies of other software engineering domain adopted the SLR approach [6], [8], [30], [31]. By following the guidelines of Kitchenham and Charters [29], we have performed the all the steps of SLR. The adopted SLR protocols are enlisted in Figure 1 and each step explained in the sub-sequent sections.
1) Planning the Review
a: Research Questions
The goal of SLR study is to investigate and analysis the best practices of CGSD reported in the literature. Therefore, the developed research question is RQ1 which is presented in section 1.
b: Data Collection Source
For the collection of most related data related to study objectives, the selection of most appropriate digital repositories is significant. Though, we have considered the instruction of Zhang et al. [32] and Chen et al. [33] and the following repositories were selected:
“IEEE Xplore (http://ieeexplore.ieee.org)”
“ACM Digital Library (http://dl.acm.org)”
“Springer Link (http://link.springer.com)”
“Wiley Inter-Science (http://www.wiley.com)”
“Science Direct (http://www.sciencedirect.com)”
“Google Scholar (http://scholar.google.com)”
“IET Software (https://digital-library.theiet.org)”
c: Search String
An appropriate search string plays an important role to explore the most related and potential literature related to the research questions of the study [32]. Therefore, we have used the guidelines of Quasi-Gold Standard (QGS) [34] to collect the key terms and their respective substitutes from the 5 studies i.e. [SS1, SS2, SS3, SS4, SS5]. The Boolean operator “AND” and “OR” were used to concatenate the collected key-terms (Table 1).
d: Initial Inclusion and Exclusion Criteria
We have developed the inclusion criteria’s to initially refine the selected studies considering the study RQs. To develop the criteria, we have used the instruction of Kitchenham and Charters [29] and the existing studies of Niazi et al. [6] and Inayat et al. [35].
The collected literature should be published as a conference paper, journal articles or book chapters.
The selected literature should elaborate the best practices of CGSD paradigm.
The findings of the selected literature should be based on empirical investigations.
The selected articles or book chapter should provide the detailed description of CGSD paradigm.
Moreover, we developed the exclusion criteria considering the existing studies of Niazi et al. [6] and Inayat et al. [35] and Akbar et al. [31].
The studies do not provide the detail discussion of CGSD best practices.
The content of the extracted data not in English language.
The article having less than 6 pages were not entertained.
If research article is from same research group or project, the most recent and completed version was entertained.
e: Study Quality Assessment (SQA)
Quality of the selected literature was assessed aiming to determine the worth of selected literature with respect to the study objective. To do this, the quality assessment criteria was developed using the instruction of Kitchemhm and Charctros [29] and by following the studies published in other domains of software engineering [5], [8], [36]. The list of quality assessment criteria’s checklist and the considered Likert scale is presented in Table 2.
2) Conducting the Review
a: Final Study Selection
To address the RQs of this study, the literature was selected via three different approaches. Firstly, we have considered the guidelines of QGS [34] and 5 papers were selected. Secondly, by carefully executing the search string (section 3.1.3) 520 papers were collected. The collected literature was further purify by applying the tollgate technique introduced by Afzal et al. [37]. By carefully applying tollgate approach steps, finally 54 were selected for data collection process. At third step, we have performed the backward and forward snowballing considering the cited references in the paper and the references in which a selected paper is cited. Using the snowballing technique, 47 papers were extracted and by applying the tollgate approach, finally 12 studies were considering from Phase-3 (snowballing, Figure 2). Though, as presented in Figure 2, Finally 71 studies were considering for data extraction process. All the selected studies were labeled as “SS” to indicate their use in this paper. The final list of SLR studies is presented in Appendix-A.
b: Data Extraction and Synthesis
The final selected studies were considered in data extract process. First three authors of this study participated in article reviewing and data extraction process. The data extraction team, continuously reviewed the data from the selected studies. Author no. 4 and 5 arbitrary involved and validate data extraction process.
Firstly, the themes, statements, concepts and ideas are captured and enlisted. The selected data were rephrased and finally formed the list of best practices of CGSD paradigm. The list of investigated best practices is given in Table5.
There might be a chance of researcher’s biasness in data extraction process. Though, we have conducted an inter-rater reliability test [37], to determine the researchers biasness. We have invited 3 external experts from empirical research lab. They selected 15 paper and conducted the data extraction process. Based on the data extraction results of research team and external experts, we have determined the “non-parametric Kendall’s coefficient of concordance (W)”; the value of W = 1 and W = 0 presents the complete agreement and disagreement, respectively. Hence, the determined results (W = 0.89, p = 0.004) presented that there is no biasness in the data extraction process. Though, we are confident that the extracted SLR results are consistent.
3) Reporting the Review
a: Quality of Selected Studies
In SLR, it is important to check the quality of selected studies with respect to the study RQs. To measure effectiveness of selected literature, the developed criteria presented in section 3.1.6 were used. The summarized results revolved that 80% of the selected studies score ≥60% and this indicates that the selected literature has the potential to answer the proposed RQ of this study. The detail quality assessment score is presented in Appendix-A.
b: Adopted Research Methods and the Publication Years of Selected Studies
In order to check the frequency of publication with respect to time duration on the CGSD context. It is noted that the selected studies are published between 2010 to 2020. This renders that the current era is the critical research period for cloud based global software development paradigm. The year-wise frequency of publication is presented in Figure 3.
Moreover, the adopted research approaches used in selected studies were also extracted (Figure 3). The frequency analysis shows that Questionnaire survey (QS) 18%, Case studies (CS) 27%, grounded theory (GT) 9%, Content analysis (CA) 11%, Action research (AR) 10% and Mixed methods were adopted by 25% of the selected studies. The analysis shows that case studies and the mixed methods are the most widely adopted research method in SLR studies.
B. Questionnaire Survey Study
The questionnaire survey study in an effective approach to collect the data from dispersed and potential population.
Kitchenham and Pfleeger [38] highlighted that the selection of data extraction process is based on nature of data “available data collection resources”, “controlling mechanism of selected approach” and “skill to operate the variable of interest”. It is hard to collect the representative set of data using observational data collection process [30], [31]. Though, we have used the questionnaire survey approach to collect the data from the experts of CGSD working across the globe at different geographical locations.
1) Questionnaire Development
To get the perceptions of industry practitioners concerning to the collected best practices from the literature, an online questionnaire was developed. The developed questionnaire was hosted via Google form (docs.google.com/forms). The questionnaire survey is an effective way to collect and find the potential population working across the border. The key objective of questionnaire survey approach is to verify the best practices identified via SLR study. Though, the developed questionnaire was based on three core parts. The first part consists of the questions related to bibliographic information of survey respondents. The second part contain the list of best practices and their core categories. The third section is close-ended and an in which we request the participants to add additional best practices that are not enlisted in second section.
2) Pilot Assessment of the Questionnaire Survey
The survey instrument was developed based on the previous experiences of study authors. The developed questionnaire was further verified with experts to check the correctness and understandability of the questionnaire [12], [36], [39]. In pilot assessment procedure, three experts were invited; form with one is from City University Hong-Kong and rest of both are from industry i.e. AMAZON India and QSoft-Vietnam. The participants give some important suggestion to improve the understandability and readability of the questionnaire. Firstly, they suggest to present the queries in table form. They also suggest some questions for collection of bibliographic information. All the points highlighted by the experts were entertained and the questionnaire was updated. The updated questionnaire (Appendix-B) was used for data collection process.
3) Ethics Approval
The ethical approval was obtained from research advisor committee of “Department of Computer Science, Nanjing University of Aeronautics and Astronautics-Nanjing-China”. Once the permission is granted, we have stated the data collection process by sending the online link of questionnaire survey to the targeted population.
4) Data Sources
The aim of empirical investigations is to verify the best practices identified using SLR. To address the RQs of this study, we adopted the snowballing technique to approach the potential population. The snowballing is an effective approach to collect the data from dispersed population, and it is also cost effective approach [36], [40], [41]. The snowballing is carried using the Email, LinkedIn and Research-Gate. The data collection process was executed between November-2019- to March-2020. During data collection process 98 response were collected and the collected responses were manually checked to found the completeness of identified responses. During manual checking we noted 12 incomplete responses. Though, based on the suggestions of our research team, only the complete responses were used for further analysis.
C. Phase 3: Fuzzy Set Theory and AHP
Over the decades the AHP is considered as an effective approach to address the MCDM problems. AHP is a traditional MCDM techniques that consider the judgment of industry experts as it is and use the crisp number that leading to insensitivity of the uncertainty that came from linguistic variable [46]. In AHP method, it is prerequisites to check the expert’s opinions with respect to harmony, proficiency and carefulness. If the above stated criteria are satisfied then AHP is considered to be an efficient approach for addressing the MCDM problems [47]. Though, the AHP approach does not consider the vagueness of once judgment. Therefore, the vagueness and fuzziness exists in several MCDM problems may cause to imprecise the judgment of a decision maker in AHP method [48]. Hence, the fuzzy-AHP technique consider values between 0-to-1 to address the vague statements.
In fuzzy-AHP, the theme of fuzzy logic was used to address the vague data and gives a systematic roadmap for controlling the ambiguous and undefined situations [49]. Thus, the experts can specify their opinions in natural language expression with respect to the criticality of each criteria [50].
The questionnaire was used to collect the experts opinions working in GSD. The collected data contains the experts judgements with respect the best practices of CGSD projects. Therefore, we apply the fuzzy-AHP approach to fix the uncertainties and vagueness in the expert’s opinions with respect to the best practices of CGSD paradigm.
1) Fuzzy Set Theory
Zadeh et al. [51] introduced a of fuzzy set theory which is an extended version of traditional fuzzy set theory. The updated fuzzy set theory is more effective and oriented to manage the vagueness and uncertainties exist in industry practices. The membership function
2) Fuzzy Analytical Hierarchy (Fuzzy-AHP) Process
The fuzzy-AHP is one of the most powerful approach adopted to address the MCDM problems. The key advantage of fuzzy-AHP approach are the relative ease which assist to manage the multicriteria, easy to understand, and it can efficiently address both quantitative and qualitative data.
Definition:
“A triangular fuzzy number (TFN) F is denoted by a set (fl, fm, fu) as presented in Figure 4. The Eq.1 presents the membership function \begin{align*} \mu _{F} (x)=\left \{{{{\begin{array}{cc} {\dfrac {x-f^{l}}{f^{m}-f^{l}},} & {f^{l}\le x\le f^{m}} \\ {\dfrac {f^{u}-x}{f^{u}-f^{m}},} & {f^{m}\le x\le f^{u}} \\ {0,} & {Otherwise} \\ \end{array}}} }\right \}\tag{1}\end{align*}
The algebraic operational laws using two TFNs, namely (
Following steps are required to perform the fuzzy-AHP.
Step1:
Hierarchy development of the key problem (Figure 5).
Step2:
Determination of priority weigh using the pair-wise comparison matrix.
Step3:
Perform the consistency check
Step4:
Calculation of final ranking.
Though the traditional AHP has several benefits and complexities at the same time. The most critical disadvantage of AHP is its limitation with respect to the usability in Crisp situations, the unbalanced judgment scale, uncertainty and the judgment selection is subjective.
Therefore, the extended version of AHP is fuzzy-AHP was developed to address the MCDM problems more effectively [58]. The fuzzy-AHP has the capacity to address the vagueness and uncertainties in the expert’s opinions more effectively [50], [59]–[61]. We have adopted the fuzzy-AHP approach proposed by Chang [62], to rank the investigated set of best practices.
In prioritization problem, let X = {\begin{align*}&F^{1}_{gi},F^{2}_{gi},\ldots,F^{m}_{gi},\tag{2}\\&i= \textrm {}1,2,\ldots,n\tag{3}\end{align*}
And
The main steps of Chang’s extent analysis approach is as follows [62]:
Step 1: The fuzzy synthetic value concerning to \begin{equation*} S_{i} =\sum \limits _{j=1}^{m} {F^{j}_{gi}} \otimes \left [{ {\sum \limits _{i=1}^{n} {\sum \limits _{j=1}^{m} {F^{j}_{gi}}}} }\right]^{-1}\tag{4}\end{equation*}
To accomplish the expression \begin{equation*} \sum \limits _{j=1}^{m} {F^{j}_{gi}} =\left({\sum \limits _{j=1}^{m} {f^{l}_{gi},\sum \limits _{j=1}^{m} {f^{m}_{gi}},\sum \limits _{j=1}^{m} {f^{u}_{gi}}} }\right)\tag{5}\end{equation*}
\begin{equation*} \sum \limits _{i=1}^{n} {\sum \limits _{j=1}^{m} {F^{j}_{gi}}} =\left({\sum \limits _{i=1}^{n} {f^{l}_{i},\sum \limits _{i=1}^{n} {f^{m}_{i}},\sum \limits _{i=1}^{n} {f^{u}_{i}}} }\right)\tag{6}\end{equation*}
\begin{equation*} \left [{ {\sum \limits _{i=1}^{n} {\sum \limits _{j=1}^{m} {F^{j}_{gi}}}} }\right]^{-1}=\left({\frac {1}{\sum \limits _{i=1}^{n} {f^{l}_{i}} },\frac {1}{\sum \limits _{i=1}^{n} {f^{m}_{i}}},\frac {1}{\sum \limits _{i=1}^{n} {f^{u}_{i}}}}\right)\tag{7}\end{equation*}
Step 2: AsFa and \begin{equation*} V (F_{\mathrm {a}}\ge F_{\mathrm {b}}) = {\textit{sup}}[{\textit{min}}({\it \mu } _{F\mathrm {a}} (x), ({\it \mu } _{F\mathrm {b}} (x))]\tag{8}\end{equation*}
Equation 8 can also be similarly specified as below:\begin{align*} V\left ({{F_{a} \ge F_{b}} }\right)=&hgt(F_{a} \cap F_{b})=\mu _{F_{a}} (d) \\[-2pt]=&\left \{{{{\begin{array}{cc} 1 & {if\,f^{m}_{a} \ge f^{m}_{b}} \\ {\dfrac {f^{u}_{a} -f^{l}_{b}}{(f^{u}_{a} -f^{m}_{a})+(f^{m}_{b} -f^{l}_{b})}} & {f^{l}_{b} \le f^{u}_{a}} \\ 0 & {Otherwise} \\ \end{array}}} }\right \} \\ {}\tag{9}\end{align*}
Here, d indicates the ordinate of the highest intersection point between D,
Step 3: Calculation of overall degree of possibility of a convex fuzzy number and the other convex fuzzy numbers \begin{equation*} V(F\ge F_{1},F_{2},F_{3}\ldots.F_{k})=\min V(F\ge F_{i})\tag{10}\end{equation*}
\begin{equation*} d^{\prime }(F_{i})=\min V(F_{i} \ge F_{k})\tag{11}\end{equation*}
With the help of Eq. 12, calculate the weight vector using Eq. 11.\begin{equation*} W^{\prime }=\,(d^{\prime }(F_{1}),\,d^{\prime }(F_{2}),\,d^{\prime }(F_{3}),\ldots..d^{\prime }(F_{n}))\tag{12}\end{equation*}
Step 4: Determine the normalized weights vector using equation 13 and their outcome will be not fuzzy value which indicates the priority weight of each criteria:\begin{equation*} W=\,(d(F_{1}),\,d(F_{2}),\,d(F_{3}),\ldots..d(F_{n}))\tag{13}\end{equation*}
Step 5: Consistency check: The pairwise comparison matrix should be consistent in fuzzy-AHP analysis [64]. Therefore, the determination of consistency check of all the pairwise comparison matrix is necessary. To address this concern, we applied the graded mean integration for the defuzzification of matrixes. A triangular fuzzy number, denoted as P = (l, m, u), can be defuzzified to a crisp number as follows:”\begin{equation*} P_{crisp\,} =\,\frac {(4m+l+u)}{6}\tag{14}\end{equation*}
Once the matrixes are defuzzified, the consistency index could be determined easily by using the equation 15 and 16. If the determined value of consistency ration (CR) is less than 0.10, the matrixes are consistent, else the fresh opinions were required for pairwise comparison matrixes.\begin{align*} CI=&\frac {I_{\max } -n}{n-1}\tag{15}\\ CR=&\frac {CI}{RI}\tag{16}\end{align*}
, Indicate the largest eigenvalue of pair-wise comparison matrixes.\text{I}_{\mathrm {max}} n: Present the elements of each matrix.
RI: The standard values of RI are presented in Table 4.
Results and Analysis
This section consists of the result and discussion of study findings.
A. SLR Findings
The best practices of CGSD paradigm were identified by following the step by step procedure of SLR. During the SLR data extraction and synthesis process, a total of 30 best practices were identified that are particularly related with core knowledge areas of PMBOK [65]. The identified best practices present the guidelines to the successful execution of CGSD activities. The list of explored best practices and their respective categories are enlisted in Table 5.
B. Empirical Data Analysis
This study consists of the summarized results collected from the questionnaire survey study.
1) Bibliographic Data Analysis of Survey Practitioners
The bibliographic data was collected aiming to analyze the appropriateness of survey particiapents with respect to the domain andstudy objective. A summary of collected bibliographic data is discussed in this section and the detail information is given in Appendix –C.
2) Designation of Survey Participants
Finstad et al. [37] and Niazi et al. [30] emphasized that the position of experts mater a lot while collecting their opinions. They also underlined that an expert can give correct feedback if he has good experience on dealing the same types of daily maters. Though, using the bibliographic data, we have summarized the positions of the survey respondents (Figure 7). The results show that most of survey participates are software project manager and software developer. This renders that the collect data has the potential to address the research objective.
3) Participants Experience Based Analysis
The participant’s experienced based analysis is presented in Figure 8. The results shows that the participants experience ranges from 2 to 10 years. The calculated mean and median (6 and 5.5) illustrates that most of the participants renders young pool. Thus, the results show that there is a good mix of participants having different level of experience in software development process.”
4) Participant’s Organization Size-Based Analysis
The organization size of survey participants is presented in pie chart (Figure 9). The presented results indicated that from total population of participants 25% belongs to small scale organization, 50% works in medium size organization and 35% of participants are from large well-established organization.
5) Feedback of Industry Practitioners
A total of 30 best practices of CGSD paradigm were enlisted by conducting SLR study. The identified best practices and their core categories were further validated with the industry practitioners. To empirically verify the identified best practices and their core categories, an online survey questionnaire was developed using Google Form platform. The question of survey instrument is based on the list of best practices explored via SLR study. The survey responses were collected considering the Likert scale “i.e. five points scale” and the values of Likert scale are classified into core three categories i.e. positive (“strongly agree and agree”), negative (strongly disagree, disagree) and neutral. The positive category consists of the response who considered as the investigated best practices are related with CGSD paradigm in real-world industry. The negative category renders the responses of those participants who do not consider as the explored best practices for real world practices. The response of neutral category presents those participates who do not sure about the impact of investigated best practices on CGSD paradigm. The Table 6 shows the calculated results of survey participants.
The results of empirical study shows that survey participants agreed on the identified best practices and their respective categories are related with industry practices. All the enlisted best practices are scored ≥60% for all the reported best practices, hence this indicated the importance of best practices for the success and progression of CGSD paradigm. In addition, we also noted that the survey participants are agree with the categories of identified best practices. The results shows that C6 (Time = 91%) is the top ranked category among all the investigated best practices. C7 (Scope = 90%) and C3 (Communication = 88%) are the
C. Fuzzy-AHP Analysis
The fuzzy-AHP analysis approach has been applied aiming to prioritize the investigated set of best practice with respect to their criticality for the successful execution of CGSD paradigm. The experiment was conducted using “MATLAB R2016b”tool developed by American mathematician. The steps adopted to perfume the fuzzy-AHP analysis are briefly discussed in the sub-sequent sections:
Step-1 (Hierarchy Structure of Best Practices and Their Corresponding Categories): The hierarchy structure was developed to resolve issues of decision making by following step by step application of fuzzy-AHP (like Figure 5). Using the list of best practices and their respective categories, we have designed the hieratical structure (Figure 10).The Figure 10 shows the key objective of this study at level 1 while level 2 and 3 presents the core categories with their respective best practices.
Step-2 (Conducting the Pairwise Comparison): The basic aim of pairwise comparison is to determine the rank order of each best practices with respect to their importance for CGSD paradigm. To address this concern, we have performed the pairwise comparison with with the CGSD experts. Though, to accomplish this task we have designed the questionnaire and contact to the respondents of the first survey (section 3.2). After gathering, all responses from survey participants we have in total 31 responses. The responses were reviewed manually in detail by authors to check incomplete entries. During manual check, we found all the response complete and useable. The used questionnaire for fuzzy-AHP study is presented in Appendix D. The 31 responses of pairwise comparison survey might be not strong enough to generalize the results of fuzzy-AHP analysis. We noted that the FAHP analysis is a subjective approach, and the data collected from a small sample size is also acceptable [51]. Various existing studies used small data sets to [42], [53], [54] also used the small sample size for fuzzy-AHP analysis.
Furthermore, the collected responses for fuzzy-AHP analysis were further transformed into TRN number using the geometric mean. To transform the human judgments into TRN number, the geometric mean is an effective method. Thus, the used formula of geometric mean is given below:\begin{align*} \mathrm {Geometric mean}=&\text {n}\sqrt {\mathrm {t1 \times t2 \times t3\ldots \ldots \ldots.tn}} \\[-3pt] \text {t}=&\text {indicate the response score} \\[-3pt] \text {n}=&\text {Number of responses}\tag{17}\end{align*}
Step-3 (Determining the Local Priority Weight of Each Best Practice): The priority weight for all the best practices were determined to check the significance of each best practice within their respective category. Firstly, the synthetic extent values of four best practices of human resource management category using Equation 3. Furthermore, the priority weight of all the best practices were determined using Equation 4. An example of local priority weigh calculation of human resource management category best practices is presented below. Table 8 present the pairwise comparison of four best practices of human resources management category.
The synthesis values of human resource management category best practices (P1 to P4) were calculated using Equation 4 as follow
Using the Equation 6, the degree of possibility is calculated. The “minimum degree of possibility (priority weight) for each pair-wise comparison was calculated using” Equation 8.
Hence, the determined weights are: W’ = (1, 0.030119, 0.69846, 0.36415) (Table 9). By normalizing these values, the significance of attributes was determined as W = (0.4789, 0.01435, 0.3337). The given results reveal that P1(Organization management makes the CGSD process improvement practices as an important part of the development processes) is declared as the most important best practice in human resource management category compared with other three best practices.”
Step-4 (Test the Consistency of the Pair-Wise Matrix): This section contains all the steps required to determine the consistency of pair-wise comparison matrixes. To do this, the table of human resource management category best practices (Table 10) was considered. A triangular fuzzy number of the pair-wise comparison matrix of the best practices of human resource management are defuzzified to crisp number using Equation 14 and resulted the corresponding Fuzzy Crisp Matrix (FCM) as presented in Table 10:”
To determine the largest Eigen vector (\begin{equation*} \lambda _{\mathrm {max}}=\sum \left({\left[{\sum \text {Cj}}\right] \times \{\text {W}}\right)\tag{18}\end{equation*}
\begin{align*} \lambda _{\mathrm {max}} = & 2.7\ast 0.37938 + 7.0\ast 0.14945 + 3.7\ast 0.27593 \\& \qquad \qquad \qquad \qquad \quad + 5.2\ast 0.19524 = 4.1067 \end{align*}
\begin{align*} CI=&\frac {\lambda _{\max } -n}{n-1}=\frac {4.1067-4}{4-1}=0.035553 \\ CR=&\frac {CI}{RI}=\frac {0.035553}{0.9}=0.039503\end{align*}
Step-5 (Calculation of Global Weights): The objective of local weight (LW) calculation is to check determine the priority rank of a best practices within their respective category. For example, the human resource management category consists of 4 best practices (i.e. P1 to P4). The local weigh of each best practices of human resource management category were determined compared with the four best practices. On the other side, the global weight (GW) were determined compared with all the investigated 30 best practices of CGSD paradigm. The objective of global weigh determination is to check the priority order of each best practise for overall CGSD paradigm. The global weigh was determined by multiplying the local weigh of best practicewith the corresponding category weight. For example, best parctice global weight P1 = local weigh of
Step-6 (Ranking of Investigated Best Practices): The global weigh was considered to calculate the final ranks of each bet practices (Table 20). The final raking present the priority order of all the best practices concerning to their significance of CGSD paradigm. The results shows that P25 (Project planning done in order to estimate all the required resources) is declared as the highest priority best practice for the successful execution of CGSD paradigm. As in CGSD, the development is conducted at overseas sits across the globe; though the project planning is significant to estimate the resources required at overseas sites. The results revolved that P8 (Detail process improvement knowledge), P9(Frequent planning of interactions between distributed sits: daily stand-up/call improves this largely), P27 (Explicitly describe the benefits of CGSD to both team members and organization) and P5 (Management committed to support CGSD team members) are declared as the top five most important best practise that must be considered by the practitioner on priority while dealing with agenda of success and progression of cloud based global software development environment. CGSD paradigm. The ranking of all the investigated best practices are presented in Table 21.
Summary and Discussions
The objective of this work is explored and prioritize the best practices of CGSD paradigm. For the success and progression of CGSD projects, it is required to develop the new strategies and tools of CGSD activities. The exploration and prioritization of best practices will assist to focus on the key area of CGSD paradigm. The list of best practices, their categorizations and their prioritization, also provides the prioritization-based taxonomy of the best practices, which provides the body of knowledge to practitioner to develop the effective strategies for the success and progression of CGSD paradigm. The summarized description of study research questions is discussed below:
RQ1 (Identification of CGSD best practices)
A systematic literature study was conducted to explore the best practices of CGSD paradigm reported in the literature. By conducting the SLR study, a list of 30 best practices were identified that are important for the successful execution of CGSD paradigm.
Moreover, we used the core categories of PMBOK, and classified the identified best practices into eight key categories. In addition, to verify the best practise collected from the literature and their categorization process, we have performed questionnaire survey study with industry experts. The survey study results revolved that the identified best practices using SLR and the mapping process of identified best practices into PMBOK categories are useful for industry practices.
RQ2 (Prioritization of investigated best practices)
The fuzzy-AHP approach has been applied to rank the investigated set of best practices and their respective categories. To perform the fuzzy-AHP analysis, we have conducted fuzzy-AHP survey study with experts. Based on the finding of fuzzy AHP survey study, the pairwise comparison matrixes were developed. All the steps of fuzzy AHP approach were carefully applied and determine the local and global weights of identified CGSD best practices. Considering the calculated local weights, the best practices were locally ranked in their respective categories. The local ranking assists to determine the significance of best practices within their respective categories. Moreover, the global ranking was determined to check the significance of identified set of 30 best practices. The global ranking assists to determine the priority order of all the best practices compared with the set of identified set of 30 best practices. The according to the results presented in Table 20, P25 “(Project planning done in order to estimate all the required resources), P8(Detail process improvement knowledge), P9 (Frequent planning of interactions between distributed sits: daily stand-up/call improves this largely), P27 (Explicitly describe the benefits of CGSD to both team members and organization) and P5 (Management committed to support CGSD team members) are declared as the top five best practices for the successful execution of CGSD paradigm.”
RQ3 (Prioritization based taxonomy of best practise)
The prioritization-based taxonomy of the best practices was developed considering the local and global ranks obtain by applying the fuzzy AHP approach. The local ranks refer to the priority order of a best practice with in their respective category. Besides, the global ranks present the significance level of a specific best practice compared with all the identified best practices. Therefore, the prioritization-based taxonomy shows the variation in the ranking of best practices with respect to local and global ranking. For example, P3 (Conduct regular training session for team members) is ranked as 1st within the human resource management category and considering the global ranking, it standout at 7th most significant best practice for CGSD projects. We further noted that, P25 (Project planning done in order to estimate all the required resources) is standout at 1st with respect to both local and global rankings. This indicated that, P25 is the most critical best practice for ‘Scope’ category, and for overall study objective which is the prioritization of CGSD paradigm.
Furthermore, the significant variation in the local and global rankings of the best practices is also observed. For example, in communication category, P10 (Frequently visit the geographically distributed teams to decrease the communication gap) is ranked as 1st with respect to local ranking and 16th by considering the global ranking. In Time category P23 (A mechanism developed to avoid time pressure) is ranked as 1st and 13th by using the local and global rankings, respectively. This variation between local and global ranking help the practitioner to adopt the highest priority best practices, with respect to their working position and objectives. The developed prioritization-based taxonomy of the CGSD best practices provides the body of knowledge to industry experts and academic researchers to develop the new and effective plan and strategies for the success and progression of CGSD paradigm.
Threats to Validity
The data was extracted from the limited digital repositories and this might cause the missing of some related studies. Based on the other studies, this is not a systematic problem [35], [44], [66], [67].
Similarly, the extracted data from the selected studies might be not consistent and have uncertainties. We address this threat by conducting the inter-rater reliability test and the results shows that there is no researcher’s baseness and the extracted data is consistent.
An external threat towards the generalization of study results is the small sample size of empirical study. The data set consists (n = 86) might not strong enough to generalize the results of this study. Though, with reference to the other studies of software engineering domain [16], [45], [46], this sample size is representative of generalizing the study results.
Most of the survey participants were from developing countries (Asian countries); this may hinder to generalize the study results. Moreover, we also noted that a representative number of respondents are form developed continents (the USA or Australia), and this allows the generalization of results.
Conclusion and Future Directions
This study aims to explore the best practices of cloud based global software development reported in the literature. Though, conducting the systematic literature review a set of 30 best practices were investigated. The explored best practices were further categorized into core categories of PMBOK. With the aim to get the feedback and perception of industry practitioners, we have further conducted the questionnaire survey study. The results of questionnaire survey study revolved that, the investigated set of best practices and their mapping process is relevant with industry practitioners. In addition, we have adopted the fuzzy-AHP technique aiming to prioritize the identified best practices concerning to their criticality for CGSD paradigm. The prioritization based results revolved that ‘project planning done in order to estimate all the required resources, ‘detail process improvement knowledge’, “frequent planning of interactions between distributed sits: daily stand-up/call improves this largely”, “explicitly describe the benefits of CGSD to both team members and organization’ and ‘management committed to support CGSD team members” are the top five most important best practices for the success and progression of cloud based global software development paradigm. Based on the fuzzy-AHP results, and the core categories of best practices, we have developed the prioritization-based taxonomy of the investigated best practices, which assists the software practitioner and academic experts, to review and develop the new strategies for the successful execution of CGSD paradigm. We believe that the results and analysis of this study will serve as a knowledge base for the industry expert and researchers with respect to the best practices of CGSD paradigm and their significance for the successful execution of CGSD paradigm.
In future, we plan to identify the factors that could negative or positive impact on CGSD practices. We also plan to map the investigated best practise against each challenge and success factor, and this process will provide the guidelines for experts to the successful execution of CGSD paradigm in industry.
Appendixes
Appendixes
Appendix-A:
List of selected studies and quality assessment score (https://tinyurl.com/y2yzhkmx)
Appendix-B:
Sample of used survey instrument (https://tinyurl.com/y6sl5o6m)
Appendix-C:
Detailed bibliographic data of survey participants (https://tinyurl.com/y5qzuw6r)
Appendix-D:
Sample of fuzzy-AHP data collection instrument (https://tinyurl.com/y5zu9aau).