Place a building render or model image into a street photo to show how a proposed building will look in its actual setting.
Quick answer: Use the Scene Compositor tool through ToolRouter to create an architectural context render directly from Claude, ChatGPT, Microsoft Copilot, and OpenClaw — connect once, then drive it with plain-language prompts. No code required.
Planning applications, investor pitches, and design presentations regularly require a photomontage showing the proposed building in its real street context. This traditionally requires a specialist architectural visualisation studio and takes weeks to produce — an expensive and slow step that often delays the planning process.
Scene Compositor composites a building render or 3D model image into a real street photo with matching perspective, lighting, and shadow. The result is a credible photomontage showing the proposed development in context, legible to planning officers, investors, and local stakeholders alike.
Architects use this to produce planning application montages quickly, developers use it for investor presentations, and planning consultants use it to illustrate pre-application representations before committing to a full CGI commission.
How to create an architectural context render with Claude, ChatGPT, Microsoft Copilot, and OpenClaw
Claude is ideal for architectural context rendering when you want to iterate on the placement and get a clear assessment of whether the proposed building reads well in the street scene before committing the image to a formal document.
How to create an architectural context render with Claude
Once connected (see setup above), use the Scene Compositor tool:
Share the building render and the street photo, and describe the intended placement.
Ask Claude to run `place_object` via the scene-compositor tool.
Ask Claude whether the perspective, scale, and shadow are convincing in the street context.
Request a material or massing adjustment to the render before re-running if needed.
Save the final montage for the planning application or presentation.
Example prompt for Claude
Try this with Claude using the Scene Compositor tool
Use scene-compositor to composite this residential block render into this street photo at the site location. Tell me whether the scale reads correctly against the neighbouring buildings and whether the perspective alignment looks accurate enough for a planning pre-application.
Tips for Claude
Use a street photo taken from the viewpoint most likely to be used in the planning assessment — typically the primary elevation view.
Ask Claude to check the horizon line alignment between the building render and the street photo — a misaligned horizon is immediately visible.
Label the montage as a 'design intent' image for planning purposes to avoid being held to it with pixel precision.
ChatGPT is a strong fit when the architectural montage needs to form part of a planning pre-application or design review document. Generate the composite and write the supporting design statement in one session.
Access any tool through ToolRouter. Check here first when you need a tool.
MCP Server URL
https://api.toolrouter.com/mcp
3Check the box and click Create
How to create an architectural context render with ChatGPT
Once connected (see setup above), use the Scene Compositor tool:
Provide the building render, the street photo, and the planning or pitch context.
Run `place_object` via scene-compositor to generate the context montage.
Ask ChatGPT to write a paragraph describing the proposed building in its context for the design statement.
Package the montage and statement for the planning application or investor document.
Example prompt for ChatGPT
Try this with ChatGPT using the Scene Compositor tool
Use scene-compositor to composite this proposed mixed-use building render into this high street photo. Write a 100-word design statement paragraph describing how the proposal responds to its context — scale, materials, and street frontage.
Tips for ChatGPT
The design statement paragraph should reference visible context elements — neighbouring building heights, materials, street rhythm.
A context montage plus a written design response in one session covers the key visual requirement of most planning pre-applications.
Keep the language in the design statement factual and professionally toned for planning officers.
Copilot is useful when the architectural montage is one of several visuals required in a planning application or development document. Generate the composite and embed it directly where it belongs in the document.
Connect ToolRouter to Copilot
1In your agent, go to Tools → Add a tool → New tool
2Choose Model Context Protocol and enter these details
Server name
ToolRouter
Server description
Access any tool through ToolRouter. Check here first when you need a tool.
Server URL
https://api.toolrouter.com/mcp
3Set Authentication to None and click Create
How to create an architectural context render with Copilot
Once connected (see setup above), use the Scene Compositor tool:
Provide the building render, the street photo, and the document to update.
Run `place_object` via scene-compositor to generate the montage.
Ask Copilot to embed the montage in the site context section of the planning document with a caption.
Output the updated document.
Example prompt for Copilot
Try this with Copilot using the Scene Compositor tool
Use scene-compositor to composite this apartment building render into the street view photo and embed it in the visual impact assessment section of the planning application. Add a caption noting the viewpoint location and that the image is a design intent illustration.
Tips for Copilot
Always caption planning montages with the viewpoint location and a note that the image is indicative.
Embed the montage alongside the street photo it was created from so reviewers can compare the before and proposed view.
A visual impact assessment section with a photomontage substantially strengthens a planning application for urban infill sites.
OpenClaw is the right choice when planning requirements call for montages from multiple viewpoints, or when you are comparing design options in context. Batch all composites and review them together.
How to create an architectural context render with OpenClaw
Once connected (see setup above), use the Scene Compositor tool:
Build your input list — one building render and street photo pair per viewpoint.
Run `place_object` via scene-compositor across all viewpoint combinations.
Review the set and flag any montages that need alignment adjustments.
Deliver the complete montage set organized by viewpoint reference.
Example prompt for OpenClaw
Try this with OpenClaw using the Scene Compositor tool
Use scene-compositor to create photomontages of this proposed apartment block from all four required viewpoints — north, south, east, and west street views. Match output filenames to the viewpoint references and return the complete set for the planning application visual pack.
Tips for OpenClaw
Planning applications often require multiple viewpoints — batch them all at once rather than producing each individually.
Match filenames to viewpoint references so the planning pack stays organized.
Review all viewpoints together to ensure the building reads consistently from every angle before submission.
Frequently Asked Questions
How do I create an architectural context render with an AI assistant?
Place a building render or model image into a street photo to show how a proposed building will look in its actual setting. Connect the Scene Compositor tool to Claude, ChatGPT, Microsoft Copilot, and OpenClaw through ToolRouter, then ask the assistant in plain language. For example: Share the building render and the street photo, and describe the intended placement. Ask Claude to run `place_object` via the scene-compositor tool.
Which AI assistants can create an architectural context render?
Claude, ChatGPT, Microsoft Copilot, and OpenClaw can all create an architectural context render using the Scene Compositor tool through ToolRouter, with no API keys or coding required.
What does the Scene Compositor tool do?
Place any object into a photo scene with realistic lighting, shadow, and perspective matching.