Apache Drill favicon

Apache Drill
Schema-free SQL Query Engine for Hadoop, NoSQL and Cloud Storage

What is Apache Drill?

Apache Drill is a schema-free SQL query engine designed for use with Hadoop, NoSQL databases, and cloud storage services. It allows users to query raw data in-situ, eliminating the need for data loading, schema creation/maintenance, or pre-processing transformations.

Drill supports a wide range of data sources, including HBase, MongoDB, HDFS, Amazon S3, Azure Blob Storage, and Google Cloud Storage. It offers a JSON-based data model that handles complex and evolving data structures and integrates with popular BI tools via JDBC and ODBC drivers.

Features

  • Schema-free Queries: Query data in-situ without needing to define schemas beforehand.
  • Data Source Flexibility: Supports a wide variety of NoSQL databases and file systems, including HBase, MongoDB, HDFS, Amazon S3, Azure Blob Storage, and Google Cloud Storage.
  • SQL Support: Utilizes standard SQL for querying, allowing users to leverage existing skills and BI tools.
  • JSON Data Model: Handles complex/nested data and evolving structures.
  • Columnar Execution Engine: Optimizes query performance with an in-memory shredded columnar representation.
  • Data Locality Awareness: Reduces network traffic by co-locating with the data store.
  • Datastore-Aware Optimizer: Restructures query plans to leverage the datastore's internal processing.
  • JDBC/ODBC Drivers: Enables integration with BI tools like Tableau, Qlik, MicroStrategy, and Excel.

Use Cases

  • Querying raw data in Hadoop directories.
  • Joining data across multiple datastores, such as MongoDB and Hadoop.
  • Analyzing user profiles in MongoDB combined with event logs in Hadoop.
  • Directly querying data stored in Amazon S3 buckets.
  • Connecting NoSQL databases to BI tools for visualization and analysis.
  • Developing custom applications with visualizations using the REST API.
  • Rapid data exploration on a laptop or scaled to large clusters.

FAQs

  • What datastores does Apache Drill support?
    Drill supports a variety of NoSQL databases and file systems, including HBase, MongoDB, MapR-DB, HDFS, MapR-FS, Amazon S3, Azure Blob Storage, Google Cloud Storage, Swift, NAS and local files.
  • Can I use BI tools with Apache Drill?
    Yes, Drill supports standard SQL and provides JDBC and ODBC drivers, allowing integration with BI tools like Tableau, Qlik, MicroStrategy, Spotfire, SAS and Excel.
  • Does Drill require schema definition before querying?
    No, Drill is a schema-free query engine. You can query raw data directly without pre-defining schemas.

Related Queries

Helpful for people in the following professions

Blogs:

  • Best AI tools for recruiters

    Best AI tools for recruiters

    These tools use advanced algorithms and machine learning to automate tasks such as resume screening, candidate matching, and predictive analytics. By analyzing vast amounts of data quickly and efficiently, AI tools help recruiters make data-driven decisions, save time, and identify the best candidates for open positions.

  • Boost Engagement in Ads with AI

    Boost Engagement in Ads with AI

    Discover how AI music and AI SDR agents are reshaping modern advertising. Learn how emotional resonance through AI-generated soundtracks combined with smart, automated sales outreach can turn viewers into loyal customers faster, cheaper, and more personally than ever before.

  • Best AI tools for Room Design

    Best AI tools for Room Design

    Discover cutting-edge AI tools that redefine the art of room design. From layout optimization to aesthetic finesse, these top-tier tools enhance your space to new heights.

Didn't find tool you were looking for?

Be as detailed as possible for better results