tree ed5d5c1a50ab2b790e14dea638767094939d037a
parent 19097f272fe3227c71c86338bb8bf788e87cd4aa
author manishgupta88 <tomanishgupta18@gmail.com> 1539007734 +0530
committer ravipesala <ravi.pesala@gmail.com> 1539072531 +0530

[CARBONDATA-2990] Queries slow down after some time due to broadcast issue

Problem
It is observed that during consecutive run of queries after some time queries are slowing down. This is causing the degrade in query performance.
No exception is thrown in driver and executor logs but as observed from the logs the time to broadcast hadoop conf is increasing after every query run.

Analysis

This is happening because in carbon SerializableConfiguration class is overriden from spark. Spark registers this class with Kryo serializer and hence the computation using the kryo is fast. The same benefit is not observed in carbondata becuase of overriding the class.
Internal Spark sizeEstimator calculates the size of object and there are few extra objects in carbondata overriden class because of which the computation time is increasing.
Solution
Use the spark class instead of overriding the class in carbondata

This closes #2803
