Covering Scientific & Technical AI | Saturday, December 21, 2024

MarkLogic Updates Connector for Hadoop 

MarkLogic Corporation today announced a significant update to its Connector for Hadoop that allows Hadoop applications direct access to data indexed and managed by the MarkLogic Enterprise NoSQL database platform. The Connector helps enterprises realize the value of Hadoop by simplifying data management, reducing infrastructure costs, and increasing development agility. Using the Connector, a Hadoop application can directly read all of the data from MarkLogic's compressed data files stored in the Hadoop Distributed File System (HDFS*), without communicating through a MarkLogic database or exporting the data.

With MarkLogic running on Hadoop, indexes are created once and can be used over the life of the data for secure, real-time, transactional queries and updates as well as large-scale batch analysis using MapReduce. This helps to reduce storage and infrastructure costs normally associated with siloed data marts and special-purpose analytic environments. Having fewer copies of the data also simplifies data governance, reducing risk. This is especially critical to organizations in highly regulated industries -- such as financial services, healthcare, and the public sector -- that are aggressively moving to Hadoop for next-generation data management infrastructure.

The MarkLogic Connector for Hadoop is the latest milestone in MarkLogic's ongoing effort to bring more value to customers leveraging Hadoop technology. Last year, the company unveiled its Tiered Storage strategy and announced new tiered storage features in its recently announced MarkLogic 7.

MarkLogic's new Tiered Storage offering allows customers to deploy the MarkLogic database platform using a mix of locally attached SSD and spinning disk, SAN, NAS, S3, and HDFS storage within the same database. Administrators can move data consistently between tiers with full transactional guarantees and zero downtime. This pioneering approach can reduce storage costs, while making it easy to incorporate Hadoop into an enterprise architecture. Combined with MarkLogic's schema-agnostic data model, MarkLogic Tiered Storage provides unprecedented flexibility to make smarter tradeoffs in live systems among cost, performance, and availability without having to change application code.

AIwire