INTEGRATING GRAPH DATABASE MODELING INTO AGILE SYSTEM ANALYSIS AND DESIGN: A CONCEPTUAL FRAMEWORK FOR SCALABLE INFORMATION SYSTEMS
DOI:
https://doi.org/10.53555/wfehqv74Keywords:
graph database modeling, agile system analysis and design, property graph model, schema evolution, scalable information systems, iterative developmentAbstract
Relational database modeling is still the most popular in system analysis and design (SAD), but has structural constraints in relationship-intensive domains like social networks, fraud detection, recommendation systems, and knowledge graphs (Angles & Gutierrez, 2008; Robinson et al., 2015). These constraints are aggravated in agile systems with iterative delivery and emergent design (Beck et al., 2001; Fowler, 2002; Schwaber & Sutherland, 2017), in which architectural decisions that are susceptible to traversal are often deferred.
This study presents the Graph-Agile Framework, a design science artifact that brings the concepts of graph database modeling into agile SAD using a three-tier architecture: Strategic Alignment, Tactical Integration, and Operational Adaptation. Grounded in design science research and theory synthesis (Gregor & Hevner, 2013; Hevner et al., 2004; Jaakkola, 2020a), the framework institutionalizes traversal-aware reasoning, schema optionality, and sprint-level scalability validation into the iterative workflows.
An expert-based evaluation (N = 40) assessed structural clarity, theoretical grounding, agile alignment, scalability support, and adoption feasibility. Findings indicate consistently high expert-rated internal coherence and compatibility with agile practices, providing preliminary perceptual support for the framework’s utility.
The study advances SAD theory beyond relational assumptions and extends agile methodology by formalizing traversal-aware architectural governance for scalable graph-centric system development.
References
1.Aalabaf-Sabaghi, M. (2012). Networks, Crowds and Markets: Reasoning about a Highly Connected World. Journal of the Royal Statistical Society Series A: Statistics in Society, 175(4). https://doi.org/10.1111/j.1467-985x.2012.01069_4.x
2.Ambler, S., & Sadalage, P. (2006). Refactoring databases: Evolutionary database design. In Queue (Vol. 4, Number 7).
3.Ambler, S. W. (2003). Agile Database Techniques — Effective Strategies for the Agile Software Developer. In Database.
4.Angles, R. (2012). A comparison of current graph database models. Proceedings - 2012 IEEE 28th International Conference on Data Engineering Workshops, ICDEW 2012. https://doi.org/10.1109/ICDEW.2012.31
5.Angles, R., & Gutierrez, C. (2008). Survey of graph database models. ACM Computing Surveys, 40(1). https://doi.org/10.1145/1322432.1322433
6.Beck, K. (1999). Extreme Programming Explained: Embrace Change. In XP Series.
7.Beck, K., Beedle, M., Bennekum, A. Van, Cockburn, A., Cunningham, W., Fowler, M., Grenning, J., Highsmith, J., Hunt, A., Jeffries, R., Kern, J., Marick, B., Martin, R. C., Mellor, S., Schwaber, K., Sutherland, J., & Thomas, D. (2001). Manifesto for Agile Software Development. The Agile Alliance.
8.Boehm, B. (2002). Get ready for agile methods, with care. Computer, 35(1). https://doi.org/10.1109/2.976920
9.Bonifati, A., Fletcher, G., Voigt, H., & Yakovets, N. (2018). Querying Graphs. Synthesis Lectures on Data Management, 10(3). https://doi.org/10.2200/s00873ed1v01y201808dtm051
10.Burton-Jones, A., Recker, J., Indulska, M., Green, P., & Weber, R. (2017). Assessing representation theory with a framework for pursuing success and failure. MIS Quarterly: Management Information Systems, 41(4). https://doi.org/10.25300/MISQ/2017/41.4.13
11.Chen, P. P. S. (1976). The Entity-Relationship Model—toward a Unified View of Data. ACM Transactions on Database Systems (TODS), 1(1). https://doi.org/10.1145/320434.320440
12.Cockburn, A. (2006). Agile software development: the cooperative game. Building, 113.
13.Codd, E. F. (1970). A Relational Model of Data for Large Shared Data Banks. Communications of the ACM, 13(6). https://doi.org/10.1145/362384.362685
14.Cohn, M. (2004). User Stories Applied: For Agile Software Development (Addison Wesley Signature Series). In Writing (Vol. 1, Number 0).
15.Date, C. J. (2003). An Introduction to Database Systems 8e. In Pearson.
16.De Virgilio, R., Maccioni, A., & Torlone, R. (2014). Model-driven design of graph databases. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 8824. https://doi.org/10.1007/978-3-319-12206-9_14
17.Dingsøyr, T., Nerur, S., Balijepally, V., & Moe, N. B. (2012). A decade of agile methodologies: Towards explaining agile software development. In Journal of Systems and Software (Vol. 85, Number 6). https://doi.org/10.1016/j.jss.2012.02.033
18.Dominguez-Sal, D., Urbón-Bayes, P., Giménez-Vañó, A., Gómez-Villamor, S., Martínez-Bazán, N., & Larriba-Pey, J. L. (2010). Survey of graph database performance on the HPC scalable graph analysis benchmark. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 6185 LNCS. https://doi.org/10.1007/978-3-642-16720-1_4
19.Dong, H., Dacre, N., Baxter, D., & Ceylan, S. (2024). What is Agile Project Management? Developing a New Definition Following a Systematic Literature Review. Project Management Journal, 55(6). https://doi.org/10.1177/87569728241254095
20.Elmasri, Ramez and Navathe, S. B. (2016). Fundamentals of database systems seventh edition. In Webseiten entwickeln mit ASP.NET.
21.Fensel, D., Şimşek, U., Angele, K., Huaman, E., Kärle, E., Panasiuk, O., Toma, I., Umbrich, J., & Wahler, A. (2020). Knowledge Graphs: Methodology, Tools and Selected Use Cases. In Knowledge Graphs: Methodology, Tools and Selected Use Cases. https://doi.org/10.1007/978-3-030-37439-6
22.Fowler, M. (2002). Refactoring: Improving the design of existing code. Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2418. https://doi.org/10.1007/3-540-45672-4_31
23.Fowler, M. (2004). UML Distilled: A Brief Guide to the Standard Object Modeling Language. Pearson Paravia Bruno Mondad.
24.Francis, N., Green, A., Guagliardo, P., Libkin, L., Lindaaker, T., Marsault, V., Plantikow, S., Rydberg, M., Selmer, P., & Taylor, A. (2018). Cypher: An evolving query language for property graphs. Proceedings of the ACM SIGMOD International Conference on Management of Data. https://doi.org/10.1145/3183713.3190657
25.Gregor, S. (2006). The nature of theory in Information Systems. MIS Quarterly: Management Information Systems, 30(3). https://doi.org/10.2307/25148742
26.Gregor, S., & Hevner, A. R. (2013). Positioning and presenting design science research for maximum impact. In MIS Quarterly: Management Information Systems (Vol. 37, Number 2). https://doi.org/10.25300/MISQ/2013/37.2.01
27.Hevner, A. R., March, S. T., Park, J., & Ram, S. (2004). Design science in information systems research. MIS Quarterly: Management Information Systems, 28(1). https://doi.org/10.2307/25148625
28.Holzschuher, F., & Peinl, R. (2013). Performance of graph query languages: Comparison of cypher, gremlin and native access in Neo4j. ACM International Conference Proceeding Series. https://doi.org/10.1145/2457317.2457351
29.Iosup, A., Hegeman, T., Ngai, W. L., Heldens, S., Pérez, A. P., Manhardt, T., Chafi, H., Capotă, M., Sundaram, N., Anderson, M., Tănase, I. G., Xia, Y., Nai, L., & Boncz, P. (2015). LDBC graphalytics: A benchmark for large scale graph analysis on parallel and distributed platforms. Proceedings of the VLDB Endowment, 9(13). https://doi.org/10.14778/3007263.3007270
30.Jaakkola, E. (2020a). Designing conceptual articles: four approaches. AMS Review, 10(1–2), 18–26. https://doi.org/10.1007/s13162-020-00161-0
31.Jaakkola, E. (2020b). Designing conceptual articles: four approaches. AMS Review, 10(1–2), 18–26. https://doi.org/10.1007/s13162-020-00161-0
32.Nathan, A. J., & Scobell, A. (2017). OMG Unified Modeling Language, Version 2.5.1. Foreign Affairs, 91(5).
33.Needham, M., & Hodler, A. E. (2020). Graph algorithms: Practical examples in Apache Spark & Neo4j. In Angewandte Chemie International Edition, 6(11), 951–952.
34.Newman, M. E. J. (2018). Network structure from rich but noisy data. In Nature Physics (Vol. 14, Number 6). https://doi.org/10.1038/s41567-018-0076-1
35.P. Abrahamson, Outi Salo, Jussi Ronkainen, & Juhani Warsta. (2002). Agile software development methods: Review and analysis. VTT Publications.
36.Pavlopoulos, G. A., Secrier, M., Moschopoulos, C. N., Soldatos, T. G., Kossida, S., Aerts, J., Schneider, R., & Bagos, P. G. (2011). Using graph theory to analyze biological networks. In BioData Mining (Vol. 4, Number 1). https://doi.org/10.1186/1756-0381-4-10
37.Rajaraman, A., & Ullman, J. D. (2011). Mining of massive datasets. In Mining of Massive Datasets (Vol. 9781107015357). https://doi.org/10.1017/CBO9781139058452
38.Robinson, I., Webber, J., & Eifrem, E. (2015). Graph Databases: New Opportunities for Connected Data. In Information Management.
39.Schwaber, K., & Sutherland, J. (2017). The Scrum Guide: The Definitive The Rules of the Game. Scrum.Org and ScrumInc, (November).
40.Vicknair, C., Macias, M., Zhao, Z., Nan, X., Chen, Y., & Wilkins, D. (2010). A comparison of a graph database and a relational database. https://doi.org/10.1145/1900008.1900067
41.Wand, Y., & Weber, R. (1995). On the deep structure of information systems. Information Systems Journal, 5(3). https://doi.org/10.1111/j.1365-2575.1995.tb00108.x
42.Wand, Y., & Weber, R. (2002). Research commentary: Information systems and conceptual modeling - A research agenda. Information Systems Research, 13(4). https://doi.org/10.1287/isre.13.4.363.69
43.Whetten, D. A. (1989). What Constitutes a Theoretical Contribution? The Academy of Management Review, 14(4). https://doi.org/10.2307/258554


