Comments (2)
The bug was introduced in #648
@jeremiedbb apparently packaging
only comes pre-installed with pip when using the conda distribution, otherwise it must be installed explicitly.
In this regard #648 seems like a regression should it be reverted, or still resorting to the package_resources
way of doing if packaging
is not found at least ?
package_resources
has the disadvantage of outputting another warning because direct usage is discouraged, but in this particular case outside of conda distribution it's still the only native package with what is needed to check the dependency tree at runtime it seems.
The possible outcomes I see there would be either
- add
packaging
as a dependency of skrub - fallback on the
package_resources
solution if packaging is not importable - revert #648
from skrub.
Thank you @baggiponte and @fcharras for the investigation! Closing this as #712 fixes the issue.
from skrub.
Related Issues (20)
- support python 3.8 & 3.9 HOT 2
- Follow-up after #742 InterpolationJoin
- Test polars support HOT 1
- Drop numpy array input support for `TableVectorizer`
- `get_feature_names_out` returns lists instead of numpy arrays HOT 2
- datetimeencoder is very slow HOT 3
- TableVectorizer imputing logic is confusing HOT 5
- 2 ways GapEncoder get_feature_name_out is broken on low entropy data
- cannot run test suite in python3.12 due to warnings filter
- Add "to_datetime" to the narrative documentation
- development status in setup.cfg HOT 1
- Add a "related projects" section in the documentation HOT 1
- Adding a TableVectorizer specialization for HistGradientBoosting HOT 2
- allowing to use a different distance for the nearest neighbors in fuzzy join HOT 1
- Consider casting to float32 by default in TableVectorizer HOT 3
- Handle numerical missing values in TableVectorizer HOT 8
- Basic regression problem raises exception on inference HOT 4
- TableVectoriser's "numerical_transformer" does not accept Pipelines HOT 3
- fetch_ken_types gives same results for many embedding_table_id's HOT 2
- Potential performance issue: .to_dict method slow in pandas below 2.2 HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from skrub.