Apache Drill favicon

Apache Drill
Schema-free SQL Query Engine for Hadoop, NoSQL and Cloud Storage

What is Apache Drill?

Apache Drill is a schema-free SQL query engine designed for use with Hadoop, NoSQL databases, and cloud storage services. It allows users to query raw data in-situ, eliminating the need for data loading, schema creation/maintenance, or pre-processing transformations.

Drill supports a wide range of data sources, including HBase, MongoDB, HDFS, Amazon S3, Azure Blob Storage, and Google Cloud Storage. It offers a JSON-based data model that handles complex and evolving data structures and integrates with popular BI tools via JDBC and ODBC drivers.

Features

  • Schema-free Queries: Query data in-situ without needing to define schemas beforehand.
  • Data Source Flexibility: Supports a wide variety of NoSQL databases and file systems, including HBase, MongoDB, HDFS, Amazon S3, Azure Blob Storage, and Google Cloud Storage.
  • SQL Support: Utilizes standard SQL for querying, allowing users to leverage existing skills and BI tools.
  • JSON Data Model: Handles complex/nested data and evolving structures.
  • Columnar Execution Engine: Optimizes query performance with an in-memory shredded columnar representation.
  • Data Locality Awareness: Reduces network traffic by co-locating with the data store.
  • Datastore-Aware Optimizer: Restructures query plans to leverage the datastore's internal processing.
  • JDBC/ODBC Drivers: Enables integration with BI tools like Tableau, Qlik, MicroStrategy, and Excel.

Use Cases

  • Querying raw data in Hadoop directories.
  • Joining data across multiple datastores, such as MongoDB and Hadoop.
  • Analyzing user profiles in MongoDB combined with event logs in Hadoop.
  • Directly querying data stored in Amazon S3 buckets.
  • Connecting NoSQL databases to BI tools for visualization and analysis.
  • Developing custom applications with visualizations using the REST API.
  • Rapid data exploration on a laptop or scaled to large clusters.

FAQs

  • What datastores does Apache Drill support?
    Drill supports a variety of NoSQL databases and file systems, including HBase, MongoDB, MapR-DB, HDFS, MapR-FS, Amazon S3, Azure Blob Storage, Google Cloud Storage, Swift, NAS and local files.
  • Can I use BI tools with Apache Drill?
    Yes, Drill supports standard SQL and provides JDBC and ODBC drivers, allowing integration with BI tools like Tableau, Qlik, MicroStrategy, Spotfire, SAS and Excel.
  • Does Drill require schema definition before querying?
    No, Drill is a schema-free query engine. You can query raw data directly without pre-defining schemas.

Related Queries

Helpful for people in the following professions

Blogs:

  • Long Videos into Viral Shorts

    Long Videos into Viral Shorts

    Klap.app is an AI-powered video editing tool that transforms long-form videos into engaging short clips optimized for platforms like TikTok, Instagram Reels, and YouTube Shorts

  • Chat with PDF AI Tools

    Chat with PDF AI Tools

    Easily interact with your PDF documents using our advanced AI-powered tool. Whether you're reading lengthy reports, research papers, contracts, or eBooks, our platform lets you chat directly with your PDF files, ask questions, extract insights, and get summaries in real-time.

  • AI tools for video voice overs

    AI tools for video voice overs

    Discover the next level of video production with AI-powered voiceover tools. Enhance your content effortlessly, ensuring professional-quality narration for your videos.

  • Best Content Automation AI tools

    Best Content Automation AI tools

    Streamline your content creation process, enhance productivity, and elevate the quality of your output effortlessly. Harness the power of cutting-edge automation technology for unparalleled results

Didn't find tool you were looking for?

Be as detailed as possible for better results