Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Mining Recency–Frequency–Monetary enriched insights into resources’ collaboration behavior from event data

Published: 01 November 2023 Publication History
  • Get Citation Alerts
  • Abstract

    Organizations increasingly rely on teamwork to achieve their goals. Therefore they continuously strive to improve their teams as their performance is interwoven with that of the organization. To implement beneficial changes, accurate insights into the working of the team are necessary. However, team leaders tend to have an understanding of the team’s collaboration that is subjective and seldom completely accurate. Recently there has been an increase in the adoption of digital support systems for collaborative work that capture objective data on how the work took place in reality. This creates the opportunity for data-driven extraction of insights into the collaboration behavior of a team. This data however, does not explicitly record the collaboration relationships, which many existing techniques expect as input. Therefore, these relationships first have to be discovered. Existing techniques that apply discovery are not generally applicable because their notion of collaboration is tailored to the application domain. Moreover, the information that these techniques extract from the data about the nature of the relationships is often limited to the network level. Therefore, this research proposes a generic algorithm that can discover collaboration relationships between resources from event data on any collaborative project. The algorithm adopts an established framework to provide insights into collaboration on a fine-grained level. To this end, three properties are calculated for both the resources and their collaboration relationships: a recency, frequency, and monetary value. The technique’s ability to provide valuable insights into the team structure and characteristics is empirically validated on two use cases.

    References

    [1]
    Abbasi A., Hossain L., Leydesdorff L., Betweenness centrality as a driver of preferential attachment in the evolution of research collaboration networks, J. Informetr. 6 (3) (2012) 403–412.
    [2]
    Agrawal, K., Aschauer, M., Thonhofer, T., Bala, S., Rogge-Solti, A., Tomsich, N., 2016. Resource Classification from Version Control System Logs. In: 2016 IEEE 20th International Enterprise Distributed Object Computing Workshop. EDOCW, pp. 1–10.
    [3]
    Ahmed, A., Batagelj, V., Fu, X., Hong, S.-h., Merrick, D., Mrvar, A., 2007. Visualisation and Analysis of the Internet Movie Database. In: 2007 6th International Asia-Pacific Symposium on Visualization. pp. 17–24.
    [4]
    Aljemabi M.A., Wang Z., Empirical study on the similarity and difference between VCS-DSN and BTS-DSN, in: Proceedings of the 2017 International Conference on Management Engineering, Software Engineering and Service Sciences, ICMSS ’17, Association for Computing Machinery, 2017, pp. 30–37.
    [5]
    Aysolmaz B., Nemeth M., Iren D., A method for objective performance benchmarking of teams with process mining and DEA, in: Proceedings of 29th European Conference on Information Systems, ECIS 2021, AIS Electronic Library, 2021, p. 144.
    [6]
    Bala S., Cabanillas C., Mendling J., Rogge-Solti A., Polleres A., Mining project-oriented business processes, in: Motahari-Nezhad H.R., Recker J., Weidlich M. (Eds.), Business Process Management, in: Lecture Notes in Computer Science, vol. 9253, BPM 2016, Springer International Publishing, 2015, pp. 425–440.
    [7]
    Bala S., Mendling J., Monitoring the software development process with process mining, in: Shishkov B. (Ed.), Business Modeling and Software Design, Springer International Publishing, 2018, pp. 432–442.
    [8]
    Bala S., Mendling J., Discovering activities in software development processes, in: Stirna J., Asensio E.S. (Eds.), The Practice of Enterprise Modeling 2020 Forum, CEUR WS, 2020, pp. 54–63.
    [9]
    Bala S., Revoredo K., de A.R. Gonçalves J.C., Baião F., Mendling J., Santoro F., Uncovering the hidden co-evolution in the work history of software projects, in: Carmona J., Engels G., Kumar A. (Eds.), Business Process Management, BPM 2017, in: Lecture Notes in Computer Science, vol. 10445, Springer International Publishing, 2017, pp. 164–180.
    [10]
    Balaban A.T., Klein D.J., Co-authorship, rational Erdős numbers, and resistance distances in graphs, Scientometrics 55 (1) (2002) 59–70.
    [11]
    Bastian, M., Heymann, S., Jacomy, M., 2009. Gephi : An Open Source Software for Exploring and Manipulating Networks. In: Proceedings of the International AAAI Conference on Web and Social Media. Vol. 3. No. 1. pp. 361–362.
    [12]
    Batagelj V., Mrvar A., Some analyses of Erdős collaboration graph, Social Networks 22 (2) (2000) 173–186.
    [13]
    Bauer A., The making of HoTT book, 2013, URL https://vimeo.com/68761218. (Accessed 30 September 2010).
    [14]
    Berendt B., Hotho A., Stumme G., Towards semantic web mining, in: Horrocks I., Hendler J. (Eds.), The Semantic Web — ISWC 2002, in: Lecture Notes in Computer Science, vol. 2342, Springer, Berlin, Heidelberg, 2002, pp. 264–278.
    [15]
    Bird C., Pattison D., D’Souza R., Filkov V., Devanbu P., Latent social structure in open source projects, in: Proceedings of the 16th ACM SIGSOFT International Symposium on Foundations of Software Engineering, in: SIGSOFT ’08/FSE-16, Association for Computing Machinery, 2008, pp. 24–35.
    [16]
    Boginski V., Butenko S., Pardalos P.M., Prokopyev O., Collaboration networks in sports, in: Butenko S., Gil-Lafuente J., Pardalos P.M. (Eds.), Economics, Management and Optimization in Sports, Springer Berlin Heidelberg, 2004, pp. 265–277.
    [17]
    Bult J.R., Wansbeek T., Optimal selection for direct mail, Mark. Sci. 14 (4) (1995) 378–394.
    [18]
    Capiluppi A., Lago P., Morisio M., Characteristics of open source projects, in: Canfora G., van den Brand M. (Eds.), Proceedings of the Seventh European Conference on Software Maintenance and Reengineering, CSMR’03, IEEE Computer Society, 2003, pp. 317–327.
    [19]
    Casciaro T., Seeing things clearly: social structure, personality, and accuracy in social network perception, Social Networks 20 (4) (1998) 331–351.
    [20]
    Chen Y.-L., Kuo M.-H., Wu S.-Y., Tang K., Discovering recency, frequency, and monetary (RFM) sequential patterns from customers’ purchasing data, Electron. Commer. Res. Appl. 8 (5) (2009) 241–251.
    [21]
    Chitraa V., Thanamani D.A.S., An enhanced clustering technique for web usage mining, Int. J. Eng. Res. Technol. (IJERT) 1 (4) (2012) 1–5.
    [22]
    Constantino K., Souza M., Zhou S., Figueiredo E., Kästner C., Perceptions of open-source software developers on collaborations: An interview and survey study, J. Softw. Evol. Process 35 (5) (2023).
    [23]
    Cross R., Ehrlich K., Dawson R., Helferich J., Managing collaboration: Improving team effectiveness through a network perspective, Calif. Manage. Rev. 50 (4) (2008) 74–98.
    [24]
    Cullen K.L., Palus C.J., Appaneal C., Developing Network Perspective: Understanding the Basics of Social Networks and their Role in Leadership [White paper], Center for Creative Leadership, 2014, p. 26.
    [25]
    Dell R.F., Román P.E., Velásquez J.D., Web user session reconstruction using integer programming, in: 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology. Vol. 1, IEEE, 2008, pp. 385–388.
    [26]
    Dougherty J., Kohavi R., Sahami M., Supervised and unsupervised discretization of continuous features, in: Prieditis A., Russell S. (Eds.), Proceedings of the Twelfth International Conference on Machine Learning, Morgan Kaufmann, San Francisco (CA), 1995, pp. 194–202.
    [27]
    Guzzo R.A., Dickson M.W., Teams in organizations: recent research on performance and effectiveness, Annu. Rev. Psychol. 47 (1) (1996) 307–338.
    [28]
    Hayashi P., Abib G., Hoppen N., Validity in qualitative research: A processual approach, Qual. Rep. 24 (1) (2019) 98–112.
    [29]
    Herbold S., Amirfallah A., Trautsch F., Grabowski J., A systematic mapping study of developer social network research, J. Syst. Softw. 171 (2021).
    [30]
    Herr, B.W., Ke, W., Hardy, E., Borner, K., 2007. Movies and Actors: Mapping the Internet Movie Database. In: 2007 11th International Conference Information Visualization. IV ’07, pp. 465–469.
    [31]
    Hong S., Lee Y., Kim J., Choi I., A methodology for redesigning an organizational structure based on business process models using SNA techniques, Int. J. Innovative Comput. Inf. Control 8 (7) (2012) 5411–5424.
    [32]
    Huang S.-K., Liu K.-m., Mining version histories to verify the learning process of legitimate peripheral participants, SIGSOFT Softw. Eng. Notes 30 (4) (2005) 1–5.
    [33]
    Huang Z., Lu X., Duan H., Resource behavior measure and application in business process management, Expert Syst. Appl. 39 (7) (2012) 6458–6468.
    [34]
    Hughes A.M., Strategic Database Marketing, McGraw-Hill Pub. Co, 2005, ISBN 9780071457507.
    [35]
    Jermakovics A., Sillitti A., Succi G., Mining and visualizing developer networks from version control systems, in: Proceedings of the 4th International Workshop on Cooperative and Human Aspects of Software Engineering, CHASE ’11, Association for Computing Machinery, 2011, pp. 24–31.
    [36]
    Johannesson P., Perjons E., An Introduction to Design Science, Springer International Publishing, 2014, ISBN 978-3-319-36110-9.
    [37]
    Jooken L., Jans M., Depaire B., Mining valuable collaborations from event data using the recency-frequency-monetary principle, in: Proceedings of the Advanced Information Systems Engineering: 34th International Conference, CAiSE 2022, Springer-Verlag, Berlin, Heidelberg, 2022, pp. 339–354.
    [38]
    Kaur N., Aggarwal H., A novel semantically-time-referrer based approach of web usage mining for improved sessionization in pre-processing of web log, Int. J. Adv. Comput. Sci. Appl. 8 (1) (2017).
    [39]
    Kohavi R., Parekh R., Visualizing RFM segmentation, in: Berry M.W., Dayal U., Kamath C., Skillicorn D. (Eds.), Proceedings of the 2004 SIAM International Conference on Data Mining, SDM, SIAM, 2004, pp. 391–399.
    [40]
    Kumar A., Dijkman R., Song M., Optimal resource assignment in workflows for maximizing cooperation, in: Daniel F., Wang J., Weber B. (Eds.), Business Process Management, Berlin, Heidelberg, Springer, 2013, pp. 235–250.
    [41]
    Kvan T., Collaborative design: what is it?, Autom. Constr. 9 (4) (2000) 409–415.
    [42]
    Lai C.-Y., Li Y.-M., Lin L.-F., A social referral appraising mechanism for the E-marketplace, Inform. Manag. 54 (3) (2017) 269–280.
    [43]
    Leyer M., Iren D., Aysolmaz B., Identification and analysis of handovers in organisations using process model repositories, Bus. Process Manag. J. 26 (6) (2020) 1599–1617.
    [44]
    Li J., Cao S., The study on high-value user identification of localized information platform, J. Phys. Conf. Ser. 1883 (1) (2021).
    [45]
    Lusher D., Robins G., Kremer P., The application of social network analysis to team sports, Meas. Phys. Educ. Exercise Sci. 14 (4) (2010) 211–224.
    [46]
    Ly L.T., Rinderle S., Dadam P., Reichert M., Mining staff assignment rules from event-based data, in: Bussler C.J., Haller A. (Eds.), Business Process Management Workshops, in: Lecture Notes in Computer Science, Springer, 2006, pp. 177–190.
    [47]
    Madey G., Freeh V., Tynan R., The open source software development phenomenon: an analysis based on social network theory, in: AMCIS 2002 Proceedings, Association for Information Systems, 2002, pp. 1806–1813.
    [48]
    McCarty J.A., Hastak M., Segmentation approaches in data-mining: A comparison of RFM, CHAID, and logistic regression, J. Bus. Res. 60 (6) (2007) 656–662.
    [49]
    Mehra A., Smith B.R., Dixon A.L., Robertson B., Distributed leadership in teams: The network of leadership perceptions and team performance, Leadersh. Q. 17 (3) (2006) 232–245.
    [50]
    Meneely A., Williams L., Socio-technical developer networks: Should we trust our measurements?, in: Proceedings of the 33rd International Conference on Software Engineering, in: ICSE ’11, Association for Computing Machinery, 2011, pp. 281–290.
    [51]
    Mitrović S., Baesens B., Lemahieu W., De Weerdt J., Churn prediction using dynamic RFM-augmented Node2vec, in: Guidotti R., Monreale A., Pedreschi D., Abiteboul S. (Eds.), Personal Analytics and Privacy. an Individual and Collective Perspective, in: Lecture Notes in Computer Science, vol. 10708, Springer International Publishing, 2017, pp. 122–138.
    [52]
    Mitrović S., Baesens B., Lemahieu W., De Weerdt J., Tcc2vec: RFM-informed representation learning on call graphs for churn prediction, Inform. Sci. 557 (2019) 1–16.
    [53]
    Mitrović S., De Weerdt J., Dyn2Vec: Exploiting dynamic behaviour using difference networks-based node embeddings for classification, in: Proceedings of the International Conference on Data Science, ICDATA’18, CSREA Press, 2019, pp. 194–200.
    [54]
    Mitrović S., Singh G., Baesens B., Lemahieu W., De Weerdt J., Scalable RFM-enriched representation learning for churn prediction, in: 2017 IEEE International Conference on Data Science and Advanced Analytics, DSAA, IEEE, 2017, pp. 79–88.
    [55]
    Otte E., Rousseau R., Social network analysis: a powerful strategy, also for the information sciences, J. Inf. Sci. 28 (6) (2002) 441–453.
    [56]
    Palinkas L.A., Horwitz S.M., Green C.A., Wisdom J.P., Duan N., Hoagwood K., Purposeful sampling for qualitative data collection and analysis in mixed method implementation research, Adm. Policy Ment. Health 42 (5) (2015) 533–544.
    [57]
    Pe-Than E.P.P., Dabbish L., Herbsleb J.D., Collaborative writing on GitHub: A case study of a book project, in: Companion of the 2018 ACM Conference on Computer Supported Cooperative Work and Social Computing, CSCW ’18, Association for Computing Machinery, 2018, pp. 305–308.
    [58]
    Pe-Than E.P.P., Dabbish L., Herbsleb J.D., Collaborative writing at scale: A case study of two open-text projects done on GitHub, in: Proceedings of the ACM Collective Intelligence Conference (CI) 2019, in: ACM Collective Intelligence Conference, ACM, 2019.
    [59]
    Pe-Than E.P.P., Dabbish L., Herbsleb J., Open collaborative writing: Investigation of the fork-and-pull model, Proc. ACM Hum.-Comput. Interact. 5 (CSCW1) (2021) 1–33.
    [60]
    Phelps C., Heidl R., Wadhwa A., Knowledge, networks, and knowledge networks: A review and research agenda, J. Manag. 38 (4) (2012) 1115–1166.
    [61]
    Pika A., Leyer M., Wynn M.T., Fidge C.J., Hofstede A.H.M.T., Aalst W.M.P.V.D., Mining resource profiles from event logs, ACM Trans. Manag. Inform. Syst. 8 (1) (2017) 1–30.
    [62]
    Pika A., Wynn M.T., Fidge C.J., ter Hofstede A.H.M., Leyer M., van der Aalst W.M.P., An extensible framework for analysing resource behaviour using event logs, in: Jarke M., Mylopoulos J., Quix C., Rolland C., Manolopoulos Y., Mouratidis H., Horkoff J. (Eds.), Advanced Information Systems Engineering, in: Lecture Notes in Computer Science, Springer International Publishing, 2014, pp. 564–579.
    [63]
    Recardo R., Wade D., Mention C., Jolly J., Teams: Who Needs Them and Why?, first ed., Gulf Publishing Company, 1996, ISBN 978-0884158523.
    [64]
    Savić D., COVID-19 and Work from Home: Digital Transformation of the Workforce, Grey J. 16 (2020) 101–104.
    [65]
    Schmidtner M., Doering C., Timinger H., Agile working during COVID-19 pandemic, IEEE Eng. Manag. Rev. 49 (2) (2021) 18–32.
    [66]
    Scott J., Carrington P., The SAGE Handbook of Social Network Analysis, in: The Sage Handbook, SAGE Publications, 2011, ISBN 9781847873958.
    [67]
    Seidman I., Interviewing As Qualitative Research: A Guide for Researchers in Education and the Social Sciences, Teachers College Press, 2006, pp. 52–54. ISBN 978-0-8077-4666-0.
    [68]
    Shulman M., GitHub HoTT book project, 2018, URL https://github.com/HoTT/book. (Accessed 17 June 2022).
    [69]
    Song M., van der Aalst W.M.P., Towards comprehensive support for organizational mining, Decis. Support Syst. 46 (1) (2008) 300–317.
    [70]
    Spiliopoulou, M., Faulstich, L.C., Winkler, K., 1999. A data miner analyzing the navigational behaviour of web users. In: Proceedings of the Workshop on Machine Learning in User Modelling of the ACAI99. Vol. 7.
    [71]
    Spiliopoulou M., Mobasher B., Berendt B., Nakagawa M., A framework for the evaluation of session reconstruction heuristics in web-usage analysis, INFORMS J. Comput. 15 (2) (2003) 171–190.
    [72]
    Tan P.-N., Kumar V., Discovery of web robot sessions based on their navigational patterns, in: Zhong N., Liu J. (Eds.), Intelligent Technologies for Information Analysis. Vol. 6, Springer, Berlin, Heidelberg, 2004, pp. 193–222.
    [73]
    The Univalent Foundations Program P.-N., Homotopy type theory: The HoTT book, 2013, URL https://homotopytypetheory.org/book/. (Accessed 17 June 2022).
    [74]
    Tymchuk Y., Mocci A., Lanza M., Collaboration in open-source projects: Myth or reality?, in: Proceedings of the 11th Working Conference on Mining Software Repositories, in: MSR 2014, Association for Computing Machinery, 2014, pp. 304–307.
    [75]
    van der Aalst W., Process mining: Data science in action, Process Mining: Data Science in Action, Springer, Berlin, Heidelberg, 2016, ISBN 978-3-662-49851-4.
    [76]
    van der Aalst W.M.P., Object-centric process mining: Dealing with divergence and convergence in event data, in: Ölveczky P.C., Salaün G. (Eds.), Software Engineering and Formal Methods, Springer International Publishing, Cham, 2019, pp. 3–25.
    [77]
    van der Aalst W.M.P., Reijers H.A., Song M., Discovering social networks from event logs, in: Computer Supported Cooperative Work. Vol. 14. No. 6, CSCW, 2005, pp. 549–593.
    [78]
    van der Aalst W.M.P., Song M., Mining social networks: Uncovering interaction patterns in business processes, in: Desel J., Pernici B., Weske M. (Eds.), Business Process Management, in: Lecture Notes in Computer Science, Springer Berlin Heidelberg, 2004, pp. 244–260.
    [79]
    Vavpotič D., Bala S., Mendling J., Hovelja T., Software process evaluation from user perceptions and log data, J. Softw. Evol. Process 34 (4) (2022).
    [80]
    Wasserman S., Faust K., et al., Social Network Analysis: Methods and Applications, Cambridge University Press, 1994, ISBN 0-521-38707-8.
    [81]
    Wolf T., Schroter A., Damian D., Nguyen T., Predicting build failures using social network analysis on developer communication, in: 2009 IEEE 31st International Conference on Software Engineering, IEEE Computer Society, 2009, pp. 1–11.
    [82]
    Xue Y., Chen J., Zhou Y., Research on user discovery based on loyalty in SNS, in: Proceedings of the 2017 International Seminar on Social Science and Humanities Research, SSHR 2017, Atlantis Press, 2017, pp. 399–406.

    Index Terms

    1. Mining Recency–Frequency–Monetary enriched insights into resources’ collaboration behavior from event data
          Index terms have been assigned to the content through auto-classification.

          Recommendations

          Comments

          Information & Contributors

          Information

          Published In

          cover image Engineering Applications of Artificial Intelligence
          Engineering Applications of Artificial Intelligence  Volume 126, Issue PA
          Nov 2023
          1590 pages

          Publisher

          Pergamon Press, Inc.

          United States

          Publication History

          Published: 01 November 2023

          Author Tags

          1. Event data behavioral analytics
          2. Collaboration behavior
          3. Mining resource behavior
          4. Project mining
          5. RFM
          6. Social network analysis

          Qualifiers

          • Research-article

          Contributors

          Other Metrics

          Bibliometrics & Citations

          Bibliometrics

          Article Metrics

          • 0
            Total Citations
          • 0
            Total Downloads
          • Downloads (Last 12 months)0
          • Downloads (Last 6 weeks)0
          Reflects downloads up to 11 Aug 2024

          Other Metrics

          Citations

          View Options

          View options

          Get Access

          Login options

          Media

          Figures

          Other

          Tables

          Share

          Share

          Share this Publication link

          Share on social media