-
Notifications
You must be signed in to change notification settings - Fork 962
[REVIEW] Match DataFrame.set_index
with pandas
#6231
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Please update the changelog in order to start CI tests. View the gpuCI docs here. |
TODO: - verify_integrity - write tests - input dataframe should include `Index` and `MultiIndex` - separate tests for inplace and verify_integrity PR 6231
TODO: - write tests - input dataframe should include `Index` and `MultiIndex` - separate tests for inplace and verify_integrity - print duplicated keys PR 6231
TODO: - print duplicated keys - support for heterogenous index lists PR 6231
Codecov Report
@@ Coverage Diff @@
## branch-0.16 #6231 +/- ##
===============================================
+ Coverage 84.45% 84.73% +0.27%
===============================================
Files 82 82
Lines 13846 14219 +373
===============================================
+ Hits 11694 12048 +354
- Misses 2152 2171 +19
Continue to review full report at Codecov.
|
DataFrame.set_index
with pandasDataFrame.set_index
with pandas
- Remove empty line to prevent empty code block - Remove print statement - Parameterize `inplace` Co-authored-by: GALI PREM SAGAR <[email protected]>
Addressing review comment PR 6231
Where's the problem with CI? Can't figure out the error message. |
Probably one off issue. Rerun tests might fix it |
rerun tests |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Couple of minor changes
- ._set_index default to None - Use MultiIndex.from_frame instead of `source_data` PR 6231
@galipremsagar can you review again when you get a chance? |
- Code document updates - Code style improvements - Explicitly raising TypeError when input cannot be converted - Throwing KeyError instead of ValueError when column is not found Co-authored-by: GALI PREM SAGAR <[email protected]>
- Minor style and doc update. Co-authored-by: GALI PREM SAGAR <[email protected]>
rerun tests |
@galipremsagar is this good to go once CI reports green? This is blocking a nice dask-cudf groupby optimization 😄 |
Yep, good to go |
When using RangeIndex, it should not be instantiated unless needed. PR 6231
rerun tests |
Closes #6198
Implements:
inplace
,append
,verify_integrity
fields to the method