ArangoDB 3.4 Introduces Native Search Engine and Full GeoJSON Support
Over 40 New Features Added to Simplify Application Development
San Francisco, CA, United States, December 6th, 2018 – ArangoDB, the leading open source native multi-model database, today announced the GA release of ArangoDB 3.4 – a transactional database solution which enables developers to efficiently interact with multiple data models by using just one technology and one query language. Major new enhancements in ArangoDB 3.4 include ArangoSearch, a feature which transforms ArangoDB, when combined with traversals or joins in AQL, from a data retrieval to an information retrieval solution; and full GeoJSON Support enabled by a Google S2 Geo Index library integration.
ArangoSearch, the result of four years of research and development, combines Boolean and generalized ranking retrieval models (e.g. vector space model). Providing a rich set of information retrieval capabilities, ArangoSearch consists of 2 components – a search engine and an integration layer. The former is responsible for managing the index, querying and scoring, whereas the latter provides search capabilities for the end user in a convenient way. ArangoSearch can be combined with all three data models in ArangoDB. If used in conjunction with graph database capabilities, search results could be used, for example, to enhance fraud protection, individualize recommendations or simplify precision medicine.
Search uses a special kind of materialized view to enable full-text search on multiple collections at once. Within the view definition one can specify entire collections or individual fields that should be covered by an inverted index using one or several general text analyzers. In search queries expressed with AQL, you can rank the results using multiple scorers (TFIDF and BM25) even combined. Users can now perform relevance-based matching, phrase and prefix matching, search with complex Boolean expressions, query time relevance tuning and combine complex traversals, geo-queries, and other access patterns with information retrieval techniques.
ArangoDB 3.4 includes full support for GeoJSON, an open standard format designed for representing simple geographical features, along with their non-spatial attributes. The support encompasses all geo primitives, including multi-polygons or multi-line strings. In 3.4 there has been a distinct engineering focus on increasing query and filtering functionality and optimizing performance. To this end, 3.4 also includes a Google S2 Geometry Library integration which complements ArangoDB’s RocksDB storage engine. Additionally, users can directly visualize results in OpenStreetMap which is integrated into the Query Editor of ArangoDBs WebUI.
Other notable enhancements in ArangoDB 3.4 include:
- Query Profiler: to provide developers with more insight into complex queries, it is now possible to execute the query with special instrumentation code enabled resulting in a printed query plan with detailed execution statistics. It is now much easier to profile your queries and get insights into how much time was spent where.
- Cluster Management: enhancements include faster cluster startup, synchronization and query execution. To increase the reliability and predictability of the ArangoDB cluster, internal protocols and request handling have been significantly overhauled to improve cluster-wide query execution, an example being Distributed Collect.
- Streaming Cursors: at times the overall query performance is not a major priority, but rather how fast a user can obtain first results. Based on community feedback, 3.4 includes integrated streaming cursors which provides first results as they become available on the server.
- RocksDB is now the default Storage Engine: previous versions of ArangoDB used MMfiles as the default storage engine. With 3.4, this has changed to RocksDB. This provides numerous advantages to the user including optimized binary storage format, optional caching, reduced replication catch-up time, an exclusive collection access option, and enhanced WAL sync control.
Claudius Weinberger, CEO of ArangoDB, said: “Improved usability and enhanced application performance are at the heart of every release we deliver. We are constantly reviewing the functionality of our native multi-model solution to ensure it competes, and in many cases outperforms, single-model alternatives. In 3.4 we have introduced ArangoSearch and extended the geo-spatial search capabilities of our database, which is a huge step forward for our technology. Our commitment to innovation is reflected in the quantity of new features available in this release.”
About ArangoDB Inc.
One database, one query language and three data models. With more than 6 million downloads and over 6,800 stargazers on Github, ArangoDB is the leading native multi-model database. It combines the power of graphs, with JSON documents and a key-value store. ArangoDB lets you access and combine all of these data models with a single elegant, declarative query language.
ArangoDB is the simple, versatile and performant answer to many challenges facing developers, startups and enterprises today and in the future. Simplifying complexity and increasing productivity is the mission of ArangoDB Inc., the company behind the project.
Jan Stücke, Head of Communications
Phone: +49(0) 221 2722 999-60
ArangoDB Inc., 548 Market St
#61436, San Francisco, CA, 94104-5401