The purpose of these datasets is to support equivalence and subsumption ontology matching. There are five ontology pairs extracted from MONDO and UMLS: Source Ontology Pair Category MONDO OMIM-ORDO Disease MONDO NCIT-DOID Disease UMLS SNOMED-FMA Body UMLS SNOMED-NCIT Pharm UMLS SNOMED-NCIT Neoplas Each pair is associated with three folders: "raw_data", "equiv_match", and "subs_match", corresponding to the downloaded source ontologies, the package for equivalence matching, and the package for subsumption matching. See detailed documentation at: https://krr-oxford.github.io/DeepOnto/#/om_resources. See the incoming OAEI Bio-ML track at: https://www.cs.ox.ac.uk/isg/projects/ConCur/oaei/. See our resource paper at: https://arxiv.org/abs/2205.03447.
M. Glauer, F. Neuhaus, T. Mossakowski, and J. Hastings. German conference on artificial intelligence 2023, volume 14236 of Lecture Notes in Artificial Intelligence, page 31-45. Springer, (2023)Best paper award. Also available at https://doi.org/10.48550/arXiv.2301.08577.
P. Missier, K. Belhajjame, and J. Cheney. Proceedings of the 16th International Conference on Extending Database Technology, page 773–776. New York, NY, USA, Association for Computing Machinery, (2013)
O. Vsesviatska, T. Tietz, F. Hoppe, M. Sprau, N. Meyer, D. Dessì, and H. Sack. Proceedings of the 36th Annual ACM Symposium on Applied Computing (ACM SAC), page 1855--1863. Association for Computing Machinery, (2021)event-place: Virtual Conference.