There are various technologies on top of Hadoop such as HBase, Hive, Pig and
more. I was wondering what are the differences between them. What are the
usage scenarios that fit each one of them.
For instance, is it true to say that Pig and Hive belong to the same family
? Or is Hive more close to HBase ?
My understanding is that HBase allows direct lookup and low latency queries,
while Pig and Hive provide batch processing operations which are M/R based.
Both define a data model and an SQL-like query language. Is this true ?
Could anyone shed light on when to use each technology ? Main differences ?
Pros and Cons ?
Information on other technologies such as Jaql is also welcome.
oo 00 oo 00 oo 00 oo 00 oo 00 oo 00 oo 00 oo 00 oo 00 oo 00 oo 00 oo 00 oo
00 oo 00 oo
"If you want your children to be intelligent, read them fairy tales. If you
want them to be more intelligent, read them more fairy tales." (Albert