Sqoop

From Wikipedia, the free encyclopedia

Sqoop is a command-line interface application for transferring data between relational databases and Hadoop.[1]

Quick Facts Developer(s), Initial release ...
Apache Sqoop
Developer(s)Apache Software Foundation
Initial release1 June 2009; 15 years ago (2009-06-01)
Final release
1.4.7 / December 6, 2017; 7 years ago (2017-12-06)
RepositorySqoop Repository
Written inJava
Operating systemCross-platform
TypeData management
LicenseApache License 2.0
Websitesqoop.apache.org
Close

The Apache Sqoop project was retired in June 2021 and moved to the Apache Attic.[2]

Description

Sqoop supports incremental loads of a single table or a free form SQL query as well as saved jobs which can be run multiple times to import updates made to a database since the last import. Imports can also be used to populate tables in Hive or HBase.[3] Exports can be used to put data from Hadoop into a relational database. Sqoop got the name from "SQL-to-Hadoop".[4] Sqoop became a top-level Apache project in March 2012.[5]

Informatica provides a Sqoop-based connector from version 10.1. Pentaho provides open-source Sqoop based connector steps, Sqoop Import[6] and Sqoop Export,[7] in their ETL suite Pentaho Data Integration since version 4.5 of the software.[8] Microsoft uses a Sqoop-based connector to help transfer data from Microsoft SQL Server databases to Hadoop.[9] Couchbase, Inc. also provides a Couchbase Server-Hadoop connector by means of Sqoop.[10]

See also

References

Bibliography

Loading related searches...

Wikiwand - on

Seamless Wikipedia browsing. On steroids.