OpenAI Integration

Give OpenAI
the Drawing.
It Counts Everything.

OpenAI's multimodal models read 2D construction drawings, floor plans, electrical schematics, and scanned PDFs. They count structural elements, extract dimensions, identify materials, and return structured data your workflow can act on immediately.

OpenAI Models in RenderDraw

Three Models.
Match the Task to the Model.

OpenAI multimodal

Vision + Strong Reasoning

The primary model for takeoff automation. Reads images of construction drawings and returns structured extraction. Also strong for complex text tasks. Use when you need both vision and reasoning in the same block.

  • Multimodal: image + text input
  • 128k token context window
  • Strong structured output
Use for: drawings, scanned PDFs, and mixed image/text inputs
OpenAI fast

Fast Extraction & Formatting

Economical and fast. Good for structured data extraction from clean text inputs, short classification tasks, and formatting steps where a larger multimodal or reasoning model is not required. Use in high-volume workflows.

  • Low-latency response profile
  • Good for clean text and schema formatting
  • Cost-controlled for high-volume runs
Use for: classification, normalization, and field cleanup
OpenAI reasoning

Advanced Multi-Step Reasoning

OpenAI reasoning models use extended internal computation to solve hard, multi-step problems. Use for complex estimation calculations, constraint satisfaction problems, or cases where standard models give inconsistent answers.

  • Higher latency (extended thinking)
  • Best for hard reasoning tasks
  • Higher cost — use selectively
Use for: planning, reconciliation, and hard tradeoff analysis
Vision Configuration

Configure OpenAI
to Read Your Drawings.

When you enable vision input on an OpenAI block in RenderDraw, you configure how the drawing is delivered to the model. Three settings determine extraction quality:

  • Resolutionhigh for detailed drawings with fine text and small symbols; low for preliminary overview extraction (faster, cheaper)
  • Image source — the workflow's input document (PDF rendered to image), a CAD-exported PNG, or a specific page range from a multi-page document
  • Pre-processing — optional contrast enhancement, scale bar detection, and north-arrow identification to improve extraction accuracy
Full Vision Config Reference
OpenAI Vision Block Config
{
  "provider": "openai",
  "model": "selected-openai-vision-model",
  "vision": {
    "enabled": true,
    "resolution": "high",
    "source": "workflow.input.pdf",
    "pages": "all",
    "preprocessing": {
      "contrastEnhance": true,
      "scaleBarDetect": true
    }
  },
  "system": "You are a takeoff specialist.
  Analyze this construction drawing.
  For each structural element, return:
  type, count, unit, dimensions.",
  "temperature": 0.0,
  "maxTokens": 4096,
  "outputSchema": {
    "items": [{
      "type": "string",
      "count": "number",
      "unit": "string",
      "dimensions": "string"
    }]
  }
}
Primary Use Case

OpenAI Vision for
Construction Takeoffs.

Takeoff automation is a high-value use of OpenAI vision in construction workflows. A drawing package that takes an estimator days can be converted into structured review data in minutes.

Input: 2D Drawing PDF

Submitted by the client via email, Procore, or Salesforce attachment

Trigger

OpenAI Vision Analysis

Reads each page at high resolution. Extracts dimensions, element counts, material symbols, and annotation text

OpenAI

Write to Takeoff Workbook

Each extracted item becomes a workbook row: item name, quantity, unit, dimensions, confidence score

Workbook

Salesforce CPQ Pricing

Each workbook row is priced against the Salesforce product catalog

Datasource

Estimator Review Gate

Human reviews workbook. Approves or adjusts before quote generation

Gate
Best Practices

Get the Best Results
from OpenAI Vision.

Use High Resolution for Detail

Always use resolution: "high" for construction drawings. Low resolution misses small annotation text and fine dimension lines. The cost difference per drawing page is minimal compared to the accuracy gain.

Set Temperature to 0

For extraction tasks, set temperature to 0.0. You want deterministic, consistent output — not creative variation. Temperature above 0 introduces randomness into quantity counts, which is never desirable for takeoffs.

Define an Output Schema

Always provide an output JSON schema for extraction blocks. This forces the model to return consistent field names and types. Without a schema, field names vary between runs and downstream blocks fail to map correctly.

Include a Confidence Field

Ask the model to include a confidence score (0.0–1.0) for each extracted item. Route low-confidence items to a human review queue. Items above 0.9 confidence are typically very accurate; below 0.7 always review.

Process Page by Page

For multi-page documents, configure the vision block to process one page at a time and aggregate results. Sending all pages at once reduces per-page attention quality and can lead to missed elements on crowded drawings.

Follow with a Reasoning Block

Use OpenAI for the initial vision extraction, then pass its structured output to the best long-context or reasoning model for cross-referencing, compliance checking, or proposal drafting. The blocks are complementary, not competing.

Let OpenAI Read
Your Next Drawing.

Add an OpenAI vision block to your takeoff workflow. Configure resolution, define your output schema, and run it against a real drawing.