Add a "full series" option to the stack transforms #351

Fil · 2021-05-01T16:47:11Z

See discussion at #325 and #348 (comment)

… by number of cylinders The bins are sorted by decreasing r, so that they are all visible. The example would benefit from stackR (#197). It could also benefit from a strategy to create missing values for the line, so that it's broken when there are no data. However, it won't work with an approach such as "return empty bins" (#495), because returning empty bins will not create the *z* values for each and every category, which would be necessary if we wanted to create broken lines. This shows that a generic foolproof solution to #351 will require much more than #495 (and #489 and #491 are not better in that regard).

* This example plot computes the median of cars' economy (mpg), grouped by number of cylinders The bins are sorted by decreasing r, so that they are all visible. The example would benefit from stackR (#197). It could also benefit from a strategy to create missing values for the line, so that it's broken when there are no data. However, it won't work with an approach such as "return empty bins" (#495), because returning empty bins will not create the *z* values for each and every category, which would be necessary if we wanted to create broken lines. This shows that a generic foolproof solution to #351 will require much more than #495 (and #489 and #491 are not better in that regard). * Update test/plots/cars-mpg.js Co-authored-by: Mike Bostock <[email protected]> * Update test/plots/cars-mpg.js Co-authored-by: Mike Bostock <[email protected]> * zero, not filter * group, not bin * remove console.log * stroke, not fill Co-authored-by: Mike Bostock <[email protected]>

mbostock · 2021-08-12T16:08:09Z

Datadog calls this “default zero” interpolation: https://docs.datadoghq.com/dashboards/functions/interpolation/#default-zero

I wonder to what degree this is specific to time series. I can certainly imagine cases where it’s not specific to time series, but when it is, it seems like the bin transform with filter: null is an option for fixing the missing data. Edit: Okay, the example histogram you made is pretty convincing that we shouldn’t think of this as only a time-series problem. (Also in a related irony, this Cloud Costs notebook demonstrates the problem, but has another problem of time being represented as ordinal strings.)

Fil · 2021-08-12T19:48:40Z

It's worse than this: using the empty bin approach is necessary for a continuous ("binnable") domain, but far from sufficient—as soon as you have z or facets, you need a point (real or fake) for each element of the domain times each of the series.

mbostock · 2021-08-12T19:56:35Z

But the bin domain is the same across all groups and facets, so as long as you have at least one data point in a given group, you’ll get all the bins?

Fil · 2021-08-12T20:46:42Z

In https://observablehq.com/d/f6a7975f2ad4519a there is just one empty bin (4,750 in in the chinstrap facet), which should be mapped to 0 for data fidelity. If we push up the number of bins to 200, we start to see that issue creeping everywhere — outlined in red in the image below, all the areas should drop to zero since there is no data point in this position.

(I'm not sure that we can find a generic way to do both operations, maybe imputing missing values is something that should be left to the data-wrangling section?)

mbostock · 2021-08-12T21:04:35Z

But that example doesn’t use filter: null on the bin transform, right?

Fil · 2021-08-13T16:16:03Z

the Plot.seriesX transform is possibly superseded by #499 and #500

mbostock · 2022-02-26T20:28:12Z

I think this is probably now a duplicate of #597. (And perhaps #513 in the case of ordinal data.) At any rate they’re all closely related.

Fil · 2022-03-01T14:37:42Z

closing since this example is solved with filter: null.

Fil self-assigned this May 1, 2021

Fil mentioned this issue May 1, 2021

Should the bin transform return empty bins? #325

Closed

Fil added the enhancement New feature or request label May 3, 2021

This was referenced Aug 11, 2021

Cars MPG example plot #496

Merged

sort and filter for bin and group #495

Merged

mbostock mentioned this issue Feb 28, 2022

dense interval for line and area #792

Merged

Fil closed this as completed Mar 1, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a "full series" option to the stack transforms #351

Add a "full series" option to the stack transforms #351

Fil commented May 1, 2021

mbostock commented Aug 12, 2021 •

edited

Loading

Fil commented Aug 12, 2021

mbostock commented Aug 12, 2021

Fil commented Aug 12, 2021 •

edited

Loading

mbostock commented Aug 12, 2021

Fil commented Aug 13, 2021 •

edited

Loading

mbostock commented Feb 26, 2022

Fil commented Mar 1, 2022

Add a "full series" option to the stack transforms #351

Add a "full series" option to the stack transforms #351

Comments

Fil commented May 1, 2021

mbostock commented Aug 12, 2021 • edited Loading

Fil commented Aug 12, 2021

mbostock commented Aug 12, 2021

Fil commented Aug 12, 2021 • edited Loading

mbostock commented Aug 12, 2021

Fil commented Aug 13, 2021 • edited Loading

mbostock commented Feb 26, 2022

Fil commented Mar 1, 2022

mbostock commented Aug 12, 2021 •

edited

Loading

Fil commented Aug 12, 2021 •

edited

Loading

Fil commented Aug 13, 2021 •

edited

Loading