What is the problem with hive indexing when dealing with partitioned tables?

index
tables

asked 2023-06-20 11:47:39 +0000

nofretete
31 ●3 ●5

edit retag flag offensive close merge delete

add a comment

1 Answer

Sort by » oldest newest most voted

answered 2023-06-20 11:50:01 +0000

djk
21 ●1 ●1

The problem with hive indexing when dealing with partitioned tables is that the index is created on the entire table instead of just one partition, which can be inefficient and slow down queries. This is because hive indexes need to be rebuilt every time a new partition is added or an existing partition is modified. Additionally, the size of the index becomes too large to handle efficiently as the number of partitions increases. To overcome this problem, it is recommended to create partition-specific indexes or use an external indexing system like Apache Solr or Elasticsearch.

edit flag offensive delete link

add a comment

Your Answer

Please start posting anonymously - your entry will be published after you log in or create a new account. This space is reserved only for answers. If you would like to engage in a discussion, please instead post a comment under the question or an answer that you would like to discuss

Add Answer

Question Tools

Stats

Asked: 2023-06-20 11:47:39 +0000

Seen: 8 times

Last updated: Jun 20 '23

What is the problem with hive indexing when dealing with partitioned tables? edit

1 Answer