by dyml on 2/13/22, 4:52 PM with 133 comments
Preferably I’d just like to click on a MariaDB table and receive some plots and statistics on the columns.
Whats your experience on this?
Edit: to clarify, I don’t want to visualize the database itself (Schema’s, keys etc). Just the data within it.
by btown on 2/13/22, 7:23 PM
"Connect to an arbitrary database, create a view that joins numerous tables (including foreign tables, via blending) together, load to columnar storage on a local SSD for performance if necessary, add arbitrary derived columns (including well-defined lateral lookups for things like 'annotate this action with the date of the first action of this action's user' [0]), group by 4 of the derived columns, map two of the groupings to nested dimensions along the horizontal axis and two to the vertical axis, and show the sum or count at each cell in a resulting table, then when satisfied, drill down into a slice and turn it into a bar chart with colors that match your branding needs" - every one of those clauses can be accomplished with drag-and-drop mouse commands almost at the speed of thought.
And once you get the hang of it, there's zero impedance mismatch with hand-rolled SQL, it's just way faster to iterate on, especially with schemas where you may not remember all the columns available to you, and especially when you're doing so over screenshare with non-technical colleagues.
[0] https://help.tableau.com/current/pro/desktop/en-us/calculati...
by vijucat on 2/14/22, 2:32 AM
Use Excel, QlikSense and Tableau if Business Users need visualization. Excel pivot tables = OG data reshaping. Resist more complicated solutions : do you really need more than Excel?
QlikSense doesn't get enough love. It's actually better than Tableau at some scenarios. Or PowerBI if you're a Microsoft shop. Last time I checked, Power BI's Q&A [2] was a KILLER FEATURE. "Show me Sales per Region, Quarterly" and then you get to fine tune it. R and Shiny dashboards = last resort; too much bespoke work. 2 months using R + Shiny can be 1 week in Tableau / QlikSense / PowerBI.
1. "Efficient reshaping using data.tables" https://cran.r-project.org/web/packages/data.table/vignettes...
2. https://docs.microsoft.com/en-us/power-bi/natural-language/q...
by emilsedgh on 2/13/22, 8:13 PM
by justindeguzman on 2/13/22, 11:31 PM
We're trying to re-imagine popular SQL clients (phpMyAdmin, MySQL Workbench) to have the design and software quality of modern tools like Superhuman, Linear, etc. We make it very easy to query and create charts from your database then share it with your team.
We currently support MySQL, Postgres, PlanetScale, Yugabyte, and in the next couple of weeks SQLite and ClickHouse.
by sfifs on 2/13/22, 10:10 PM
There used to be a lot of good candidates in this space even just a few years ago but Power BI has improved it's product and integrations very rapidly and with its affordability has displaced them at many big companies. Power BI also recently added some NLP capabilities from one of Microsoft's acquisitions which makes usage by non technical users easier.
If you're willing to put your data on BigQuery, then Google Data Studio / Looker is an even better solution for larger datasets due to the seamless integration and intelligent caching which (purely in my perception) seems to work better than Azure Analytics Services in the Microsoft side. Also BigQueryML works within SQL.
Source: i lead an Analytics and Data Science team at a Fortune 50
by chrisjc on 2/14/22, 1:25 PM
Moreover, if you plan to adopt and build upon one of these tools that infers and generates the models as well as provide the explore and visualization functionality, you might be painting yourself into a corner and forcing all current and future workloads to use this layer. Otherwise you'll be having to reinterpret and reimplement your models all over the place; one off SQL scripts/reports, web analytics, dashboards/visualizations/reports on other analytics tools. Then you'll also end up having to scale this tool up in both compute and storage to handle the load that grows over time. This can end up being quite costly in time, money and responsibility.
While these tools will offer a lot of value providing visibility and insight into your data, it'll probably be worth circling back and seeing if the data can and should be modeled correctly (semantic layer) before hitching your wagon to your first choice.
Once your data is all modeled, it might be worth re-evaluating all the tools that you started with and see how they manage now that your house is a little more in order.
Remember, your modeling doesn't have to be done by the same tool that does your exploration and visualization.
This is a great article related to these ideas: https://benn.substack.com/p/is-bi-dead
by superdupershant on 2/13/22, 7:23 PM
by samwillis on 2/13/22, 7:20 PM
by datacoffee on 2/13/22, 8:59 PM
by js4ever on 2/13/22, 9:39 PM
by jmnicolas on 2/13/22, 10:56 PM
I just settled on DBeaver, but don't consider that an endorsement from my part.
I found DB Browser for SQLite to be the least bad, but it's obviously limited to SQLite.
My problem may come from the fact I have simple needs and they're all very complex apps. My SQL queries are rarely longer than 50 lines and I do DB admin tasks from the command line.
Among the unending list of apps I should code for myself there's a SQLPad project. Maybe one day.
by otoburb on 2/13/22, 8:50 PM
by Simon_O_Rourke on 2/14/22, 7:01 AM
Gephi lets me show how this kind of table bloat happens over time and helps explain performance degradation.
by dataspun on 2/14/22, 5:12 AM
PopSql - https://popsql.com
Trevor - https://trevor.io
by jraph on 2/13/22, 7:46 PM
For SQLite databases, I use sqlitebrowser.
Both tools are open source.
by chasil on 2/13/22, 8:33 PM
This is very much a Java application, and appears to allow several JDBC drivers for 3rd party databases.
It's free, and is designed to compete with (or drag underwater) Quest Software's Toad.
https://www.oracle.com/database/technologies/appdev/sqldevel...
3rd party drivers:
https://www.oracle.com/database/technologies/appdev/sqldev/t...
by mongrelion on 2/13/22, 6:36 PM
by ttz on 2/15/22, 1:50 AM
Metabase or Apache Superset, as others have mentioned, can be deployed on-prem so it's a bit more isolated/secure.
by d--b on 2/13/22, 8:08 PM
It's based on Observable (https://www.observablehq.com), which has a nice Summary table feature, sounds like what you need (https://observablehq.com/@observablehq/summary-table)
by fredrik_skne_se on 2/13/22, 7:20 PM
by go_prodev on 2/13/22, 7:55 PM
Paid: If you have the budget, Aqua Data Studio gives you the database management functionality AND all of the visualizations you'll find in Tableau in the 1 product.
(My company shifted from Tableau to Power BI. At first it seemed like a beta product with lower fidelity. But Microsoft has made the whole power suite into a force to be reckoned with... highly recommended)
by bessarabov on 2/13/22, 6:55 PM
by sdesol on 2/13/22, 11:16 PM
Looking at recently merged pull requests that were less than 120 days old, apache/superset had 89 unique authors, which is very high, as the following shows:
https://oss.gitsense.com/insights/github?q=pull-age%3A%3C%3D...
Metabase had 37 authors, which is also quite high for an opensource project, which the following shows:
https://oss.gitsense.com/insights/github?q=pull-age%3A%3C%3D...
And redash has 11, but most of the contributions were more than 28 days ago, and is significantly less active than metabase and superset as the following shows:
https://oss.gitsense.com/insights/github?q=pull-age%3A%3C%3D...
I was actually quite surprised by superset, as I never heard of them before, but they are backed by serious investment (https://preset.io/about/), which clearly shows in how active their repository is.
by chris_hoyle on 2/13/22, 7:00 PM
by eatonphil on 2/13/22, 7:42 PM
See https://www.kaggle.com/rhuebner/human-resources-data-set. I think it's a great view on top of a datatable.
by telchar on 2/13/22, 10:59 PM
by text2db on 2/14/22, 11:38 AM
by abotsis on 2/13/22, 7:55 PM
by _wei_ on 2/13/22, 10:18 PM
It works with SQL databases and CSV file. Since it's an offline desktop app, it's free to use and you don't need to set up a server or upload data.
by petilon on 2/13/22, 8:27 PM
by kinj28 on 2/16/22, 9:49 AM
Give it a go at https://www.dronahq.com
by MaknMoreGtnLess on 2/13/22, 6:16 PM
or its commercialized offer: Preset (Cloud)
by johnhess on 2/13/22, 7:36 PM
The most effective, efficient data exploration tool I've ever used. I'm a data scientist, but I use this before I write so much as a line of code.
by ioseph on 2/14/22, 3:59 AM
I've tried google data studio, superset, but what I need is a integrated control where user's can filter the report based on their location. Or alternatively which would make it easiest for me to develop this control myself.
by adrianthedev on 2/13/22, 8:33 PM
Credentials to admin panel in one click.
by nicodjimenez on 2/13/22, 8:12 PM
by FujiApple on 2/13/22, 11:24 PM
https://github.com/thenaturalist/awesome-business-intelligen...
From this I picked Metabase and found it to be pretty good.
by RyanHamilton on 2/13/22, 8:57 PM
by feupan on 2/14/22, 3:43 AM
by tehlike on 2/13/22, 6:26 PM
by arey_abhishek on 2/16/22, 8:35 AM
by a1445c8b on 2/14/22, 1:58 AM
by interlocutor on 2/13/22, 8:16 PM
by caoxhua on 2/15/22, 8:07 PM
by sawaali on 2/13/22, 8:36 PM
by zobione on 2/13/22, 7:43 PM
by muhehe on 2/13/22, 7:37 PM
by rekoros on 2/14/22, 2:36 AM
by tbrock on 2/13/22, 11:00 PM
by sc4les on 2/14/22, 12:11 AM
by pnathan on 2/14/22, 5:48 AM
by hoofhearted on 2/14/22, 12:35 AM
by exotree on 2/13/22, 10:24 PM
by gavinray on 2/13/22, 8:33 PM