The Teradata Connector for Hadoop (TDCH) provides scalable, high performance bi-directional data movement between the Teradata database system and Hadoop system.
For more detailed information on the Teradata Connector for Hadoop, please see the Tutorial document in the Teradata Connector for Hadoop Now Available article as well as the README file in the appropriate TDCH download packages. The Tutorial document mainly discusses the TDCH (Command Line Edition). The download packages are for use on commodity hardware. For Teradata appliance hardware, it will be distributed with the appliance. TDCH is supported by Teradata CS in certain situations where the user is a Teradata customer.
For more information about Hadoop Product Management (PM), Teradata employees can go to Teradata Connections Hadoop PM.
Thanks for the update, glad to see active work being done on TDCH.
If you are taking suggestions for the next release ...
1. Currently the table loaded cannot be more than 24 charaters due to the six characters added to it _ERR_1 and
_ERR_2 for the load jobs, this is a big contraint where there are already tablenames greater than 24 characters.
Work with JDBC team to provide option to specify error databasename and error tablename for fastload. (important
2. Support for queryband for the entire process and in specific for the load and export operators as TASM regulates the
number of sessions in most teradata shops. (important to have)
3. Ability to provide path where users can have pre and post load/export SQL. (nice to have)
I would like to know who is using TDCH, and what stage people are in with respect to their deployment.
Please send me an email (firstname.lastname@example.org) and let me know. Please indicate customer name if you are a Teradata customer.
There is a new README file that has been uploaded to the appropriate packages that has updates for Sections 2.2, 2.3, and 8.5.