Filter with YouPrep™¶
Overview¶
When you are building your story, you might want to filter your data to display only a part of them. Previously, you did it two ways: the query in mongo and with postprocess. Thanks to all the latest releases, you are able to filter your data without any pain thanks to the YouPrep™. You can apply the following operations to your dataset:
- Delete columns
- Keep columns
- Filter Rows
- Top N rows
- Argmax
- Argmin
Examples¶
Warning
Before you start!
I’m working on a datasource containing the GDP and population for all countries between 2012 and 2016. I want to display only the evolution of the GDP for the US and UK between 2012 and 2016. To obtain my filtered dataset, I’m going to realize different step in YouPrep™.
Go in the Query and click on “filter”. To keep only the column “Time”, “Country Name” and “GDP” you can use the delete columns or keep columns operation.
📝 Keep only the columns you need to make it crystal clear.
Your YouPrep step should look like this:
Then you need to apply the Filter Rows operation. You need to select the column, the operation and the value on which you want to apply your operation. In my example, I want to keep all the rows where the country name is “United States” OR “United Kingdom”
📝 Do not hesitate to add all your filtering operations into one step.
Your YouPrep step should look like this:
NB: You have different ways to perform this filtering operation:
👉 Using the “Is one of” filtering operation: You just need to write the name on which you want to apply your filtering operation and then press the enter button. You can add as much wording as you want.
👉 Or, using a “regex” operation:
Important
A regular expression is a sequence of characters that define a search pattern.
The syntax of the Regex expression in Toucan allow you to apply all the
regex operations (matches a single character, except characters,
characters in a range, any digit, any non-digit…) In my example i want
to keep United States and United Kingdom (and not United Arab Emirates)
so I need to add a regex expression. United [KS]
will allow me to
keep all the rows containing United
and matching a single character
in the list KS
(case sentitive).
Note
You could test your regex syntax here 👉 Go in the RegEx reference menu for all the regex syntax Or here 👉 Find all the regex syntax in the bottom right-hand corner into the Quick references block
You can also apply different operations to your dataset: Top N rows will allow you to keep a TOP N of the rows ordering them with the column of your choice. Below you can find a TOP N rows operations with a group by Country Name (if you need to aggregate your data before ordering it and keeping the top values):
📝 Top N rows could be really helpful to display a fix number of rows. It’s the perfect step to use for tile
Your YouPrep step should look like this :
You’re all set, your dataset is looking like this:
Argmin/Argmax: This operation will display you the smallest or the biggest value of your column (you can also make the choice to group by another column).
📝 The perfect tool for your HKPI!!!
Do not hesitate to read the weaverbird documentation if you need any further informations on the YouPrep!