-
Notifications
You must be signed in to change notification settings - Fork 28.7k
[SPARK-8148] Do not use FloatType in partition column inference. #6692
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
cc @liancheng |
The other thing we should consider, although I'm less sure about, is whether we should skip IntegerType and go straight to LongType. |
Test build #34396 has finished for PR 6692 at commit
|
Test build #34398 has finished for PR 6692 at commit
|
I'm worrying about skipping |
If it is persisted in the metastore, then inference is no longer used, isn't it? And why would partition columns be stored in Parquet file metadata? |
Had offline discussion with @rxin. There can be rare corner cases where compatibility issues may arise. However, stop using |
Test build #34419 has finished for PR 6692 at commit
|
Use DoubleType instead to be more stable and robust.
Test build #34457 has finished for PR 6692 at commit
|
Test build #891 timed out for PR 6692 at commit |
Use DoubleType instead to be more stable and robust. Author: Reynold Xin <[email protected]> Closes apache#6692 from rxin/SPARK-8148 and squashes the following commits: 6742ecc [Reynold Xin] [SPARK-8148] Do not use FloatType in partition column inference.
Use DoubleType instead to be more stable and robust.