Extracting Asset Hierarchy from PDF P&IDs in Cognite Data Fusion

Question

In a hypothetical scenario where we need to extract asset hierarchy directly from PDF-based P&ID documents, does Cognite provide a built-in library to support this? Or would we need to rely on external libraries like PyMuPDF or pdfplumber for data extraction—or even tools like Pytesseract in cases involving scanned images?

Thank you for any guidance on best practices or integrations for this use case!

Andre Alves · Answer

Thank you, @Elcio Cardoso da Silva. We’re aware of this project, but our main objective here is to determine if there’s any built-in solution within Cognite that we could leverage.Cognite Team, if you have any insights or suggestions regarding this, please share them with us.Thanks again!

Reply

Sign up

Log in to the community

Scanning file for viruses.

This file cannot be downloaded