Calculating Percentage good value

Question

Question from an end user:What is the best way to calculate the percentage of good values for a time series over a year using Cognite Charts? I have a list of PSVs in various units that I need to see if we have data availability of at least 95% in 2022.

Eric Stein-Beldring · Accepted Answer

@stanleychiuthe same approach should still work, except instead inputing the same value (1in my example above)in the Lower limit and Upper limit parameters of the Threshold function, you’ll simply input the range that represents the “good” values.Therefore, whenever the sensor is within the bounds of therange you specify (i.e. Is of a “good” value), the calculation will output a value of1, and0 otherwise. From here, the rest of the calculation will work. Although, again, this is assuming your pressure time seriesis uniformly sampled (no gaps, no variation in sampling frequency).When it comes to scaling this to more than 50 time series, this isn’t something that can be entirely done via the UI today and will require the assistance of a data scientist to solve. This can be done by leveraging ourPython SDK, Cognite Functions,and InDSLto recreate the calculation – plus the flexibility of working directly in python will allow one to make the calculation more robust (e.g. Different strategies for fillinggaps in the time series). Although, this is certainly a workflow we intend to support entirely from the Cognite Data Fusion UI (no coding necessary) – @Knut Vidveiwould be the best person to connect with to discuss and to share more information.In the meantime, you will need to “Duplicate” each calculation (via the …button in the More column) and/or changingthe input time series accordingly. It will take some manual work to get it set up initially, but can be used anytime thereafter by changing the date range in view.

Eric Stein-Beldring · Answer

Hi @rsiddhaand @stanleychiu,I believe I have a calculation workflow that will work for this use case:In the case above, I’m using a uniformly sampled time series (always 1 data point per hour) that produces binary results (1 = on or “good”; 0 = off or “bad”). By setting the range in view to 1 year exactly, I can be confident that there are exactly8760 data points. If this isn’t the case for your time series data, then you will need to add some additional steps your calculation (e.g. a Resample to granularityfunction).The final result of this calculation tells me that, for the past year, 98.9% of the values were “good” (or = 1) for this time series from Feb. 14, 2021 - Feb. 14, 2022.Important note: If you’re looking at a time series or time range that requires> 100k data points, this will not work since the application will automatically downsample the time series input and fetch aggregates rather than individual data points (for the time being). If this is the case, one workaround is to “batch” the overall calculation over smaller time windows (when you know the total # of data points is < 100k)and move the plot window-by-window to calculate oversmaller ranges – likely stopping after the Integration function to get the total # of “good” data points in each window. This obviously requires you to make note of each result and have a final calculation to get the % result(you can create acalculation with only constants as inputsdothis in the same chart).Let us know whether approach willsolveyour use case. Looking forward to hearing your feedback!

Calculating Percentage good value

4 replies

Reply

Cookie Policy

Cookie settings

Reply

Related topics

anonymity in respondentsicon

Results recover with limited submissions planicon

Collect data for anonymous submission: Then request contact dataicon

Change URL for existing videoicon

Results page that doesnt generate a FB conversionicon

Sign up

Log in to the community

Scanning file for viruses.

This file cannot be downloaded

Cookie Policy

Cookie settings