Spider4SSC & S2CLite: A text-to-multi-query-language dataset using lightweight ontology-agnostic SPARQL to Cypher parser
PositiveArtificial Intelligence
The introduction of the Spider4SSC dataset and the S2CLite parsing tool marks a significant advancement in the field of database query processing. S2CLite, designed as a lightweight and ontology-agnostic parser, translates SPARQL queries into Cypher queries without the need for an RDF graph or external tools. Its performance is noteworthy, achieving a parsing accuracy of 77.8% on the Spider4SPARQL dataset, which is a substantial improvement over the 44.2% accuracy of the previous leading tool, S2CTrans. Additionally, S2CLite demonstrated an impressive execution accuracy of 96.6% on overlapping queries, surpassing S2CTrans by 7.3%. The creation of the Spider4SSC dataset, which includes 4,525 unique questions and 2,581 matching queries across SQL, SPARQL, and Cypher, further enhances the utility of this tool. By open-sourcing S2CLite on GitHub, the developers encourage further innovation and collaboration in the community, making it a pivotal resource for researchers and practitioners wo…
— via World Pulse Now AI Editorial System