commit | 9119091af78b947de9ba23f23fccc43c8019adb8 | [log] [tgz] |
---|---|---|
author | Maxime Beauchemin <maximebeauchemin@gmail.com> | Wed Jun 27 21:35:12 2018 -0700 |
committer | Maxime Beauchemin <maximebeauchemin@gmail.com> | Wed Jun 27 21:40:14 2018 -0700 |
tree | b8a1ddb10a681be5806a6e271b661f82013e9902 | |
parent | af74c1b8bb845157ce446bd0ae9a49a65420c34f [diff] |
Improve database type inference (#4724) * Improve database type inference Python's DBAPI isn't super clear and homogeneous on the cursor.description specification, and this PR attempts to improve inferring the datatypes returned in the cursor. This work started around Presto's TIMESTAMP type being mishandled as string as the database driver (pyhive) returns it as a string. The work here fixes this bug and does a better job at inferring MySQL and Presto types. It also creates a new method in db_engine_specs allowing for other databases engines to implement and become more precise on type-inference as needed. * Fixing tests * Adressing comments * Using infer_objects * Removing faulty line * Addressing PrestoSpec redundant method comment * Fix rebase issue * Fix tests (cherry picked from commit 777d876a523590be0e18f573eab02137fa989fa3)
Apache Superset (incubating) is a modern, enterprise-ready business intelligence web application
[this project used to be named Caravel, and Panoramix in the past]
View Dashboards
Slice & dice your data
Query and visualize your data with SQL Lab
Visualize geospatial data with deck.gl
Choose from a wide array of visualizations
Apache Superset is a data exploration and visualization web application.
Superset provides:
Superset speaks many SQL dialects through SQLAlchemy, a Python ORM that is compatible with most common databases.
Superset can be used to visualize data out of most databases:
On top of having the ability to query your relational databases, Superset ships with deep integration with Druid (a real time distributed column-store). When querying Druid, Superset can query humongous amounts of data on top of real time dataset. Note that Superset does not require Druid in any way to function, it's simply another database backend that it can query.
Here's a description of Druid from the http://druid.io website:
Druid is an open-source analytics data store designed for business intelligence (OLAP) queries on event data. Druid provides low latency (real-time) data ingestion, flexible data exploration, and fast data aggregation. Existing Druid deployments have scaled to trillions of events and petabytes of data. Druid is best used to power analytic dashboards and applications.
Interested in contributing? Casual hacking? Check out Contributing.MD
Here's a list of organizations who have taken the time to send a PR to let the world know they are using Superset. Join our growing community!