Apache Spark

From Celeste@Hoppinglife
Jump to navigation Jump to search

Code generation

A description about whole stage execution.

Interesting papers

The idea is finding common subexpressions through logging, and selectively materialize such subquery to improve the performance. This is similar to CloudView.

References

[Mastering Spark SQL]