Skip to content

Added Region caching#306

Open
Kamil-Krynicki wants to merge 4 commits intohortonworks-spark:masterfrom
Kamil-Krynicki:master
Open

Added Region caching#306
Kamil-Krynicki wants to merge 4 commits intohortonworks-spark:masterfrom
Kamil-Krynicki:master

Conversation

@Kamil-Krynicki
Copy link
Copy Markdown

What changes were proposed in this pull request?

My team and I have been testing the shc connector with high and very high throughput and we realized that its performance dropped significantly around 400 k reads per second and, prior to these changes, we could never go above 500 k reads per second in a stable manner.

We managed to pinpoint and patch the problem. It was related to repeated region queries that saturated the cluster.

We have exposed our changes via parameters, which are disabled by default.

How was this patch tested?

This patch was tested manually. It has also been used extensively in our tests in the CERN's NxCals project for the past 3 weeks. We have deemed it stable and efficient enough to move it to our production environment.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant