What's next for TiKV Coprocessor?

skyzh · July 12, 2021, 2:55pm

After years of development, the TiKV coprocessor has become more and more powerful, while inevitably, the codebase has become even bigger.

In the past year, we have been working on various parts of the coprocessor framework, including improving the memory layout, adding new evaluation types, and adding raw coprocessor. From my perspective, I believe there are a lot of things to do:

More efficient data encoding. Directly copy encoded chunk format data from memory to protobuf fields. There were some attempts, but failed due to some unknown bugs.
More efficient memory layout. When refactoring the framework to enable the enum pushdown earlier this year, we found ourselves in a dilemma: to use a less efficient memory layout for fewer engineering efforts, or to use a very compact memory layout that requires a large refactor. After some discussions we chose the former way. There is room for improvement.
More use cases. For raw coprocessor (or so-called coprocessor v2 or coprocessor plugins), there is still no real-world usage of the new framework. We may add some example plugins.
Easier to maintain. There are a lot of unsafe transmute of lifetimes in current query executors. GAT (generic associated types) could probably save us from such a mess. We may look into possibilities to use GAT in our codebase later when this language feature becomes stable.

What’s your opinion on the future of the TiKV coprocessor?

leiysky · July 13, 2021, 4:59am

What about supporting TiDB MPP?

ichn-hu · July 16, 2021, 6:52am

Are there any dev guide for copr v2? @skyzh

skyzh · July 16, 2021, 10:46am

Most of the guide is documented in the doc string of the framework. And we provide an example plug-in. Maybe we need to write another dev guide later.

tison · July 20, 2021, 10:45am

Hi @ichn-hu , I’m drafting a proposal for TiKV Development Guide and hopefully post the first version in this week.

It will layout as TiDB Development Guide since the Get Started, Contribute, and Understanding parts are different. Topic about copr v2 should be included in Understanding TiKV IMO. What do you think @skyzh ?

skyzh · July 20, 2021, 12:38pm

Sounds good! I could help write this part.