Solved

Automatic labeling of CDF resources

  • 18 September 2022
  • 1 reply
  • 60 views

Userlevel 3
Badge

One thing that would make the finding of the correct data is to label the CDF resources better with labels. https://docs.cognite.com/dev/concepts/resource_types/labels

 

In the contextualization pipeline could things be automatically labeled such as:

Document type: P&ID, PDF(Process Flow Diagram), loop diagrams, datasheets, single line. that theses documents are either labeled from the metadata. document ID schema (regex will do it) or from classification algorithm the file, and then gives it a label to the file. that we can see from the label on the resource what it is.

The same is for Asset: Is it a Junction box, Transmitter, Skid, Pump, Motor, valve etc etc. 

Time series: is it a physical measurement, is an inference / calculation / “soft-tag”, is a parameter (like the gain or integration time on PID controller) this could be determent by simple analyzing of the data-point behavior of the time series. cause now when we search for a time series often a time series of a parameter that almost never change is often one of the first to show.

Then filtering on both the label and asset would make finding the correct data a lot easier, but the CDF projects we have in AkerBP don’t yet have this automatic labeling of the different resources. Is this something that is being looked into? 

Yes we can do it manually through the API/SDK’s, but doing it programmatically on a schedule as a part of contextualization pipeline would be amazing. in addition to that users of data can more easier label the resources that they need and use in their day to day. 

icon

Best answer by Nils Barlaug 21 September 2022, 08:29

View original

1 reply

Hi Stig,

This is a very important challenge for Cognite! We agree that good contextualized data is crucial for discoverability and searchability. Documents can actually already be automatically labeled by type by a classification model (let me know if you want more information on that), but we are currently not looking into automatically labelling assets and time series. It’s a good suggestion though, so we’ll bring it up when we plan what to do next. In the meantime we are working on several initiatives that will make finding the correct data easier. So stay tuned for future product updates 🙂

Reply