HIVE VS RDBMS

--

Hive
+++++
— Large datasets
— parallel computation
— High latency
— read operation
— HIVEQL
— schema on read, no constraints
— minimal index supported
— Row level updates,deletes as a special case
— Many more build-in functions
— equi-joins allowed
— restricted subqueries
RDBMS
++++++
— small datasets
— serial computation
— low latency
— read/write operation
— SQL
— schema on write, not null and unique all enforced
— indexes allowed
— Row level operations allowed in general
— Basic build-in functions
— No restriction on joins
— whole rabge of subquries

--

--

SIRIGIRI HARI KRISHNA
SIRIGIRI HARI KRISHNA

Written by SIRIGIRI HARI KRISHNA

Data Engineer passionate about Spark, Azure, and the Cloud. Simplifying data complexities on my Medium blog. Let's dive into the world of data together!

No responses yet