I just finished the cognite academy examples on contextualization.
I did notice on the PID contextualization example that there were quite a few errors in what seems potentially character recognition pipeline to identify tags in the PDF.
The tutorial stated that “these were all good” and “we can accept all”.
I suspect such a process would create missing or strange links in the contextualized dataset.
Are these known issues?
I lack a bit the understanding of the context for the importance of these mispredictions,
but I thought to report them anyway just in case.
Happy to support you on improving these if they are something that needs improvement.