fbpx
Wikipedia

Apache Impala

Apache Impala is an open source massively parallel processing (MPP) SQL query engine for data stored in a computer cluster running Apache Hadoop.[2] Impala has been described as the open-source equivalent of Google F1, which inspired its development in 2012.[3]

Apache Impala
Developer(s)Apache Software Foundation
Initial releaseApril 28, 2013; 10 years ago (2013-04-28)
Stable release
4.1.0 / June 28, 2022; 21 months ago (2022-06-28)[1]
RepositoryImpala Repository
Written inC++, Java
Operating systemCross-platform
TypeRelational Hadoop-analytics
LicenseApache License 2.0
Websiteimpala.apache.org

Description edit

Apache Impala is a query engine that runs on Apache Hadoop. The project was announced in October 2012 with a public beta test distribution[4][5] and became generally available in May 2013.[6]

Impala brings scalable parallel database technology to Hadoop, enabling users to issue low-latency SQL queries to data stored in HDFS and Apache HBase without requiring data movement or transformation. Impala is integrated with Hadoop to use the same file and data formats, metadata, security and resource management frameworks used by MapReduce, Apache Hive, Apache Pig and other Hadoop software.

Impala is promoted for analysts and data scientists to perform analytics on data stored in Hadoop via SQL or business intelligence tools. The result is that large-scale data processing (via MapReduce) and interactive queries can be done on the same system using the same data and metadata – removing the need to migrate data sets into specialized systems and/or proprietary formats simply to perform analysis.

Features include:

In early 2013, a column-oriented file format called Parquet was announced for architectures including Impala.[7] In December 2013, Amazon Web Services announced support for Impala.[8] In early 2014, MapR added support for Impala.[9] In 2015, another format called Kudu was announced, which Cloudera proposed to donate to the Apache Software Foundation along with Impala.[10] Impala graduated to an Apache Top-Level Project (TLP) on 28 November 2017.[11]

See also edit

  • Apache Drill — similar open source project inspired by Dremel
  • Dremel — similar tool from Google
  • Trino — open source SQL query engine created by the creators of Presto
  • Presto — open source SQL query engine created by Facebook and supported by Teradata

References edit

  1. ^ @ApacheImpala (June 27, 2022). "The Apache Impala team is pleased to announce the release of Impala 4.1.0" (Tweet) – via Twitter.
  2. ^ "Apache Impala". Retrieved 15 September 2017.
  3. ^ Cade Metz (October 24, 2012). "Man Busts Out of Google, Rebuilds Top-Secret Query Machine". Wired Magazine. Retrieved October 10, 2016.
  4. ^ Larry Digna (October 24, 2012). "Cloudera aims to bring real-time queries to Hadoop, big data". Between the lines blog. ZDNet. Retrieved January 20, 2014.
  5. ^ Andrew Brust (October 25, 2012). "Cloudera's Impala brings Hadoop to SQL and BI". ZDNet. Retrieved January 20, 2014.
  6. ^ Marcel Kornacker, Justin Erickson (May 1, 2013). . Archived from the original on April 13, 2014. Retrieved April 10, 2014.
  7. ^ "Parquet: Columnar Storage for Hadoop". Project web site. 2013. Retrieved January 20, 2014.
  8. ^ "Announcing Support for Impala with Amazon Elastic MapReduce". Amazon.com. December 12, 2013. Retrieved January 20, 2014.
  9. ^ "Impala for MapR". MapR.com. February 2, 2014. Retrieved April 10, 2014.
  10. ^ David Ramel (November 18, 2015). "Cloudera to Donate Impala and Kudu Big Data Projects to Apache". Application Development Trends. Retrieved October 10, 2016.
  11. ^ "The Apache Software Foundation Announces Apache Impala as a Top-Level Project". November 28, 2017. Retrieved November 30, 2017.

External links edit

  • Apache Impala project website
  • Impala GitHub project source code

apache, impala, open, source, massively, parallel, processing, query, engine, data, stored, computer, cluster, running, apache, hadoop, impala, been, described, open, source, equivalent, google, which, inspired, development, 2012, developer, apache, software, . Apache Impala is an open source massively parallel processing MPP SQL query engine for data stored in a computer cluster running Apache Hadoop 2 Impala has been described as the open source equivalent of Google F1 which inspired its development in 2012 3 Apache ImpalaDeveloper s Apache Software FoundationInitial releaseApril 28 2013 10 years ago 2013 04 28 Stable release4 1 0 June 28 2022 21 months ago 2022 06 28 1 RepositoryImpala RepositoryWritten inC JavaOperating systemCross platformTypeRelational Hadoop analyticsLicenseApache License 2 0Websiteimpala wbr apache wbr org Contents 1 Description 2 See also 3 References 4 External linksDescription editApache Impala is a query engine that runs on Apache Hadoop The project was announced in October 2012 with a public beta test distribution 4 5 and became generally available in May 2013 6 Impala brings scalable parallel database technology to Hadoop enabling users to issue low latency SQL queries to data stored in HDFS and Apache HBase without requiring data movement or transformation Impala is integrated with Hadoop to use the same file and data formats metadata security and resource management frameworks used by MapReduce Apache Hive Apache Pig and other Hadoop software Impala is promoted for analysts and data scientists to perform analytics on data stored in Hadoop via SQL or business intelligence tools The result is that large scale data processing via MapReduce and interactive queries can be done on the same system using the same data and metadata removing the need to migrate data sets into specialized systems and or proprietary formats simply to perform analysis Features include Supports HDFS S3 ABFS Apache HBase and Apache Kudu storage Reads Hadoop file formats including text LZO SequenceFile Avro RCFile Parquet and ORC Supports Hadoop security Kerberos authentication Ldap Fine grained role based authorization with Apache Sentry and Apache ranger Uses metadata ODBC driver and SQL syntax from Apache Hive In early 2013 a column oriented file format called Parquet was announced for architectures including Impala 7 In December 2013 Amazon Web Services announced support for Impala 8 In early 2014 MapR added support for Impala 9 In 2015 another format called Kudu was announced which Cloudera proposed to donate to the Apache Software Foundation along with Impala 10 Impala graduated to an Apache Top Level Project TLP on 28 November 2017 11 See also editApache Drill similar open source project inspired by Dremel Dremel similar tool from Google Trino open source SQL query engine created by the creators of Presto Presto open source SQL query engine created by Facebook and supported by TeradataReferences edit ApacheImpala June 27 2022 The Apache Impala team is pleased to announce the release of Impala 4 1 0 Tweet via Twitter Apache Impala Retrieved 15 September 2017 Cade Metz October 24 2012 Man Busts Out of Google Rebuilds Top Secret Query Machine Wired Magazine Retrieved October 10 2016 Larry Digna October 24 2012 Cloudera aims to bring real time queries to Hadoop big data Between the lines blog ZDNet Retrieved January 20 2014 Andrew Brust October 25 2012 Cloudera s Impala brings Hadoop to SQL and BI ZDNet Retrieved January 20 2014 Marcel Kornacker Justin Erickson May 1 2013 Cloudera Impala 1 0 It s Here It s Real It s Already the Standard for SQL on Hadoop Archived from the original on April 13 2014 Retrieved April 10 2014 Parquet Columnar Storage for Hadoop Project web site 2013 Retrieved January 20 2014 Announcing Support for Impala with Amazon Elastic MapReduce Amazon com December 12 2013 Retrieved January 20 2014 Impala for MapR MapR com February 2 2014 Retrieved April 10 2014 David Ramel November 18 2015 Cloudera to Donate Impala and Kudu Big Data Projects to Apache Application Development Trends Retrieved October 10 2016 The Apache Software Foundation Announces Apache Impala as a Top Level Project November 28 2017 Retrieved November 30 2017 External links editApache Impala project website Impala GitHub project source code Retrieved from https en wikipedia org w index php title Apache Impala amp oldid 1116544272, wikipedia, wiki, book, books, library,

article

, read, download, free, free download, mp3, video, mp4, 3gp, jpg, jpeg, gif, png, picture, music, song, movie, book, game, games.