Abstract
The assessmen of solar photovoltaic (PV) potential onurban building façades is pivotal for sustainable urbanplanning, yet is often constrained by manual, time-intensive methods. This study introduces a novel end-to-end framework, The Semantic Façade Solar-PV Assesse-ment (SF-SPA) , that pioneers a paradigm shift by lev-eraging generalist foundation models for this task. Ourpipeline integrates vision foundation models (VFMs) andlarge language models (LLMs) for rapid, automated,and accurate façade-based PV assessment from single2D street-view images. The framework was validatedon a diverse dataset of 80 buildings from four cities ac-ross different climates and architectural styles. Re-sults show high accuracy, with an average area estim-ation error of 6.2% against expert-defined ground truth,and exce-ptional efficiency at approximately 100 sec-onds per bui-lding. This work demonstrates a scalableand data effici- ent alternative to traditional methodsthat rely on 3D data or specialized trained models,paving the way for large scale urban energy analysis.
Keywords Solar PV potential, Large Language model, Semantic segmentation, Urban Building façades
Copyright ©
Energy Proceedings