-
Notifications
You must be signed in to change notification settings - Fork 325
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Implement DataFrame nunique #1137
Conversation
f11595f
to
dc496be
Compare
Does this PR solve #1124 or not? At a glance, docs are absent. |
Codecov Report
@@ Coverage Diff @@
## master #1137 +/- ##
=========================================
Coverage ? 92.97%
=========================================
Files ? 642
Lines ? 49944
Branches ? 7416
=========================================
Hits ? 46437
Misses ? 2309
Partials ? 1198
Continue to review full report at Codecov.
|
627b44a
to
93e7c70
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM overall, some comments are left.
bd3e22c
to
4432777
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
(cherry picked from commit b9f8434)
(cherry picked from commit b9f8434)
What do these changes do?
This PR implements df.unique, it's now a simple way that record all unique values and count them at the aggregation stage, it should be optimized if the amount of unique values are too large.
Related issue number
Resolves #1124 .