Image Tools, Agents, and Skills in NeqSim

Purpose

NeqSim has an engineering-image workflow for turning visual technical material into structured data for simulation, design, and reporting. The workflow supports images and image-heavy documents such as P&IDs, PFDs, mechanical drawings, vendor datasheets, scanned PDFs, compressor and pump maps, phase envelopes, and screenshots.

The core flow is:

PDF or image document
-> PDF-to-PNG conversion or direct image view
-> visual analysis and/or OCR
-> structured extraction
-> NeqSim JSON, route model, notebook, or report

Core Tools

Capability	Tool	Purpose
Direct image inspection	`view_image`	Reads PNG, JPG, JPEG, GIF, and WEBP images directly.
PDF page rasterization	devtools/pdf_to_figures.py	Converts PDF pages to PNG images for visual analysis.
Scanned PDF OCR	devtools/pdf_ocr.py	Extracts text from scanned PDFs and engineering drawings.
P&ID tag OCR	devtools/pdf_ocr.py –pid	Uses high-DPI sparse OCR for equipment, instrument, and line tags.
STID document download	devtools/stid_download.py	Downloads STID documents into a task folder and can convert PDFs to PNG.
Document extraction	`neqsim-technical-document-reading`	Provides structured extraction patterns for technical documents and images.
Report figure traceability	`results.json` figure sections	Links figures to captions, discussions, and key results.

Image-Aware Agents

Agent	Best Use
`technical.reader.agent.md`	Primary agent for reading technical documents and engineering images. Extracts structured data from P&IDs, drawings, datasheets, maps, and phase envelopes.
`extract.process.agent.md`	Converts extracted PFD, P&ID, and process data into NeqSim JSON or route models.
`literature.scout.agent.md`	Retrieves papers, standards, STID documents, vendor documents, and image-heavy references for task workflows.
`solve.task.agent.md`	Runs the full task-solving workflow and consumes extracted image and document data in engineering studies.
`review.agent.md`	Reviews task deliverables, including figure-to-discussion-to-result traceability.
`notebook.example.agent.md`	Creates notebooks with executed cells, plots, saved figures, and results metadata.
`documentation.agent.md`	Turns results, figures, and workflows into documentation.

Relevant Skills

Skill	Use
`neqsim-technical-document-reading`	Main skill for extracting structured data from documents and engineering images.
`neqsim-pdf-ocr`	OCR for scanned PDFs, P&IDs, vendor datasheets, and engineering drawings.
`neqsim-stid-retriever`	Retrieval of STID, vendor, P&ID, and datasheet documents into task folders.
`neqsim-process-extraction`	Conversion of extracted process data into NeqSim JSON or route models.
`neqsim-professional-reporting`	Figure captions, figure discussions, linked results, and report traceability.
`neqsim-notebook-patterns`	Figure generation, notebook execution, and `results.json` integration.
`neqsim-plant-data`	Plant historian data workflows when image or document extraction is combined with PI/Aspen data.
`neqsim-unisim-reader`	UniSim model extraction through COM rather than image reading. Useful when the source is an actual simulator file instead of screenshots.

Supported Image and Drawing Types

Image Type	Extracted Information
P&ID	Equipment tags, valve tags, instrument tags, line numbers, piping classes, connection topology, control loops, interlocks.
PFD	Equipment sequence, stream arrows, temperatures, pressures, flow rates, utility duties, recycle loops.
Mechanical arrangement drawing	Dimensions, nozzle sizes and orientations, standpipes, drains, elevations, piping runs, access limitations.
Vendor API datasheet image	Operating conditions, design pressure and temperature, seal data, bearing data, materials, leakage rates, utilities.
Compressor or pump performance map	Rated point, speed curves, head, flow, efficiency, surge line, stonewall, operating window.
Phase envelope	Critical point, cricondentherm, cricondenbar, operating point location, two-phase region.
Scanned report or datasheet	OCR text, tables where possible, document metadata, equipment tags, requirements.
Screenshot	Visible text, bullet lists, diagrams, high-level structure, manually readable data.

Recommended Workflows

Image File

PNG/JPG image
-> view_image
-> structured extraction
-> process JSON, notes, notebook, or report

Born-Digital PDF

PDF
-> pdfplumber or pymupdf text extraction
-> pdf_to_figures.py for diagrams and charts
-> view_image for visual pages
-> structured extraction

Scanned PDF or P&ID

Scanned PDF
-> pdf_to_figures.py at 300-400 DPI
-> view_image for visual topology
-> pdf_ocr.py for searchable text and tags
-> regex post-filtering for tags
-> structured extraction

STID or Vendor Document Task

STID/vendor source
-> stid_download.py into task references folder
-> optional PDF-to-PNG conversion
-> technical.reader extraction
-> process extraction or engineering analysis
-> results.json and report figures

Structured Outputs

Image extraction can produce structured outputs such as:

PID_EXTRACTION
VENDOR_DATASHEET_EXTRACTION
PERFORMANCE_MAP_EXTRACTION
PHASE_ENVELOPE_EXTRACTION
MECHANICAL_ARRANGEMENT_EXTRACTION
route_segments for PipingRouteBuilder
figure_captions and figure_discussion entries for reports
NeqSim JSON for ProcessSystem.fromJson() or ProcessSystem.fromJsonAndRun()

Important Practices

Convert PDFs to PNG before analyzing drawings, charts, and visual tables.
Use OCR only when text extraction is empty or unreliable.
Use 400 DPI and sparse OCR mode for P&IDs.
Cross-check visual extraction against text and tables from the same document.
Preserve original values and units alongside normalized values.
Flag uncertain values rather than guessing.
For task workflows, keep source PDFs in step1_scope_and_research/references/ and converted PNGs in figures/.
For report-quality work, every important figure should have a caption, discussion, linked result, and recommendation.

Current Limitations

The workflow reads exported images and PDFs; it is not a native parser for Echo 3D, CAD, or full 3D model databases.
OCR quality depends strongly on scan resolution and drawing clarity.
Performance-map digitization is semi-structured: axes and points can be extracted, but critical values should be checked against vendor tables when available.
P&ID topology extraction should be validated when it affects simulation wiring or safety conclusions.