Architecture & DesignComputer-VisionExperimental

PALMS+: Image-Based Floor Plan Localization with Depth Foundation Models

This is like a GPS for inside buildings: you point a camera around a room, and the system figures out exactly where you are on a 2D floor plan by using smart 3D understanding of the space.

8.0

Quality
Score

Executive Brief

Business Problem Solved

Traditional indoor localization (finding a person/device’s position inside a building) is expensive, requires special beacons/sensors, or is too inaccurate. Architects, facility managers, and property technology teams struggle to reliably align what a camera sees with existing floor plans for navigation, inspections, and asset tracking. PALMS+ aims to robustly match camera images to floor plans using AI that understands depth/geometry, without heavy infrastructure.

Value Drivers

Reduced need for custom indoor positioning hardware and beaconsFaster, more accurate mapping of photos/videos to floor plans for design, inspection, and documentation workflowsEnables indoor AR navigation and guidance using existing floor plansImproved safety and compliance checks by precisely localizing inspections on building layoutsLower operational cost for maintaining indoor maps across large portfolios of buildings

Strategic Moat

If matured, the moat would come from robust geometric localization performance across many building types and lighting conditions, plus any proprietary training datasets of real buildings and floor plans. The modular design also allows swapping in better depth foundation models over time, keeping performance competitive.

Technical Analysis

Model Strategy

Hybrid

Data Strategy

Unknown

Implementation Complexity

High (Custom Models/Infra)

Scalability Bottleneck

Running depth foundation models and geometric matching at scale on edge/mobile devices may be constrained by compute and latency; robustness across diverse real-world building layouts is another challenge.

Technology Stack

Depth Foundation Model(Low)

Market Signal

Adoption Stage

Early Adopters

Differentiation Factor

Unlike generic AR kits and visual-inertial odometry focused on SLAM and tracking (e.g., ARCore/ARKit), PALMS+ is specifically targeted at aligning monocular images with existing 2D floor plans using depth-aware foundation models in a modular fashion. This makes it more relevant for architecture, construction, real estate, and facilities workflows that already center around floor plan artifacts.

Key Competitors

Google Apple Meta

Part of Application

Architectural Design Automation

104 use cases in this application

Explore More

More in Architecture & Design→More Computer-Vision→

Source

https://arxiv.org/abs/2511.09724