DOMAIN-SPECIFIC LANGUAGE AND MODEL-DRIVEN ENVIRONMENT FOR ACCESSING DIVERSE VECTOR DATABASES
DOI:
https://doi.org/10.24867/29BE19AkikKeywords:
Vector Database, Domain-Specific Language, Model-Driven Software Development, Big Data, Machine LearningAbstract
In this paper, a model-driven environment for uniform access to various vector databases is presented. The environment supports connection to the target vector database, definition of data structures, manipulation of database objects, querying, and vector similarity search. The central element of the environment is the domain-specific language uniVEC, with a textual syntax. An important part of the environment is code generators, that transform models created by using uniVEC into executable programming code. The code generators generate Python scripts to support work with one of the chosen vector databases, such as: Pinecone, Milvus, Chroma, Weaviate, or Qdrant. The proposed environment aims to simplify work with vector databases, reduce learning time and support migrations between different vector databases.
References
[2] T. Taipalus, “Vector database management systems: Fundamental concepts, use-cases, and current challenges,” Cogn. Syst. Res., vol. 85, p. 101216, Jun. 2024, doi: 10.1016/j.cogsys.2024.101216.
[3] M. Brambilla, J. Cabot, and M. Wimmer, Model-Driven Software Engineering in Practice, 2nd ed., San Rafael, USA: Morgan & Claypool Publishers, 2017.
[4] M. Mernik, J. Heering, and A. M. Sloane, “When and how to develop domain-specific languages,” ACM Comput. Surv., vol. 37, no. 4, pp. 316–344, Dec. 2005, doi: 10.1145/1118890.1118892.
[5] D. Steinberg, F. Budinsky, M. Paternostro, E. Merks, EMF: Eclipse Modeling Framework, 2nd ed., Upper Saddle River, NJ: Addison-Wesley, 2009.
[6] L. Bettini and S. Efftinge, Implementing domain-specific languages with Xtext and Xtend, 2nd ed., Birmingham Mumbai: Packt Publishing, 2016.