Questions on performance #7

sagpid · 2014-04-23T21:43:39Z

Hi,

We download the code and were able to make it work both on a localhost deployment of cassandra and a remote deployment. Thanks a lot of the great piece of work that you have shared, and it has saved us a lot of time and effort.

Please find my questions below on performance.

About 275 map jobs are started in hadoop when a simple select count(*) is issued on the hive. This slows down the query enormously if the query is issued on hive on a external table which is located on cassandra. ( about 30 minutes for 150 records)
If I create hive table from external cassandra table it is very slow. ( About 30 minutes.

Is there a work around or something to be expected from hive side.

thanks

Sagar

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Questions on performance #7

Questions on performance #7

sagpid commented Apr 23, 2014

Questions on performance #7

Questions on performance #7

Comments

sagpid commented Apr 23, 2014