Comments (5)
@Nestak2 You have to set pandas.set_option('display.html.table_schema', True)
so that it outputs the JSON, like in this examples.
from jupyterlab-data-explorer.
@saulshanabrook I have a question, that looks related to your post - When using jupyterlab-data-explorer, how can I convert a pandas dataframe to json in the notebook, so that I can use the different graphic options of nteract's data-explorer?
As an example, see here the example jupyterlab-data-explorer notebook, where you have the different graphical options in the red rectangle.
On the otherhand, this graphical options are not available for plain dataframes (example in my personal notebook). How can I make them available?
from jupyterlab-data-explorer.
@saulshanabrook Thank you very much, I didn't know the purpose of this line, now the graphical features are there!
from jupyterlab-data-explorer.
Great! Glad it's working for you. I have added this to the usage docs to hopefully make it more clear in the future: #135
from jupyterlab-data-explorer.
FWIW,
- tablib https://tablib.readthedocs.io/en/stable/ supports a bunch of formats: 'cli, csv, dbf, df (DataFrame), html, jira, json, latex, ods, rst, tsv, xls, xlsx, yaml'
- Tabulate does [HTML, LaTeX, *] tables from lists of lists, lists of dicts, etc. (without pandas)
https://github.com/astanin/python-tabulate . Hoping for these to land in a release soon:- Jupyter support: astanin/python-tabulate#27
- astanin/python-tabulate#26
- odo
https://github.com/blaze/odo (2018) does conversion between very many formats:Odo migrates data using network of small data conversion functions between type pairs. That network is below:
odo conversionsEach node is a container type (like pandas.DataFrame or sqlalchemy.Table) and each directed edge is a function that transforms or appends one container into or onto another. We annotate these functions/edges with relative costs.
This network approach allows odo to select the shortest path between any two types (thank you networkx). For performance reasons these functions often leverage non-Pythonic systems like NumPy arrays or native CSV->SQL loading functions. Odo is not dependent on only Python iterators.
- Ibis
https://docs.ibis-project.org/backends.html- Impala, BigQuery, HDFS, Spark, SQLAlchemy, Pandas
- blazingsql
https://github.com/BlazingDB/blazingsql is really fast. It reads into the GPU from CSV, TSV, JSON, Apache Parquet, Apache ORC, Apache Hive, GDF (GPU Dataframe), S3, GCS, Apache HDFS:
https://docs.blazingdb.com/docsBlazingSQL is a GPU accelerated SQL engine built on top of the RAPIDS ecosystem. RAPIDS is based on the Apache Arrow columnar memory format, and cuDF is a GPU DataFrame library for loading, joining, aggregating, filtering, and otherwise manipulating data.
BlazingSQL is a SQL interface for cuDF, with various features to support large scale data science workflows and enterprise datasets.
- notes re: "A dataframe protocol for the PyData ecosystem"
https://discuss.ossdata.org/t/a-dataframe-protocol-for-the-pydata-ecosystem/267/9
CSVW would be ideal for tabular data (with Linked Data metadata about the dataset and each column). More about this here: "Linked Data formats, tools, challenges, opportunities; CSVW, schema.org/Dataset, schema.org/ScholarlyArticle" https://discuss.ossdata.org/t/linked-data-formats-tools-challenges-opportunities-csvw-schema-org-dataset-schema-org-scholarlyarticle/160
from jupyterlab-data-explorer.
Related Issues (20)
- [UX Review] Right panel
- [UX Review] Typography of "Datasets" heading
- [UX Review] Add ability to remove dataset from registry in UI
- What MIME types should we support in this repo?
- [UX Review] Multiple ways of tracking active/selected dataset
- Publish Python Package for exposing datasets
- Paths with spaces not working HOT 1
- Installing the plugin failed HOT 1
- Error: No provider for: @jupyterlab/dataregistry:Registry. HOT 3
- Support JupyterLab 2.1 HOT 5
- User stories for data registry HOT 2
- Issue building with JupyterLab 2.1.1 and tsc 3.9.3
- Error when building extension HOT 2
- Support JupyterLab 3.0 HOT 2
- stable version of demo fails to build on OVH cloud HOT 1
- Simplify data model by removing nested dataset capability
- Core dataset API
- Use JupyterLab tokens to strongly type dataset interfaces HOT 2
- Not support current version HOT 1
- Archiving this repository HOT 4
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from jupyterlab-data-explorer.