arXiv

Pinpoint: Grounded Worldwide Image Geolocation via Cross-Source Retrieval and Reranking

Title: Pinpoint: Achieving Global Image Geolocation Through Cross-Source Retrieval and Reranking

Abstract:

Determining the geographic origin of a photograph based on its visual elements is the core objective of image geolocation. However, scaling this process globally is difficult due to the ambiguity, diversity, and uneven distribution of visual cues. Historically, research has separated the geolocation of standard internet photographs from that of street-view images, overlooking their synergistic potential. Internet photos align more closely with the visual characteristics of user-generated queries, whereas street-view data offers denser, geographically anchored coverage.

To address this, we introduce Pinpoint, a retrieve-and-rerank framework that integrates both data sources within a coarse-to-fine workflow. The system utilizes a contrastive image-GPS embedder, trained on a combination of user-uploaded Flickr images and street-view footage, to establish a unified embedding space for image and GPS data. This space facilitates the initial retrieval of potential locations. Subsequently, an attention-based reranker refines these results by merging visual and GPS features at the candidate level with cross-source contextual evidence from surrounding areas to enhance accuracy.

Distinct from recent approaches, Pinpoint avoids the use of multimodal large-language models, thereby offering faster inference speeds and greater reproducibility. The model sets new state-of-the-art performance standards across all evaluation metrics on established benchmarks, including IM2GPS3k and YFCC4k for internet photos, as well as OSV-5M for street-view imagery.


Source: arXiv Generated at: 2026-06-04 00:00:00 UTC

Related Articles

Exelon CEO Sees Daily Cybersecurity Threats
Bloomberg

Exelon CEO Sees Daily Cybersecurity Threats

Exelon’s CEO warns of daily cybersecurity threats, highlighting persistent risks to the energy giant.

TechCrunch

Ramp raises $750M at $44B valuation as investors hunger for fintechs with an AI story

Ramp secured $750M at a $44B valuation, driven by AI integration and $1.5B+ revenue. The fintech firm now serves 70,000 ...

TechCrunch

Is Silicon Valley ready to put robots in people’s homes? Hello Robot is.

Hello Robot’s Stretch avoids Silicon Valley hype, focusing on practical home deployment to gather essential real-world d...

Canada to Provide Funding, Buy Equity Stakes in AI Startups
Bloomberg

Canada to Provide Funding, Buy Equity Stakes in AI Startups

Canada will fund and buy equity stakes in AI startups to boost the sector. This investment aims to strengthen the nation...

TechCrunch

Chinese spies are using LinkedIn to lure Westerners into sharing sensitive information

A joint Western security alert warns that Chinese spies use LinkedIn to impersonate recruiters and extract sensitive dat...

Peter Thiel’s Family Office Pays Record Rent for Top Miami Tower
Bloomberg

Peter Thiel’s Family Office Pays Record Rent for Top Miami Tower

Peter Thiel’s family office set a record rent for a Miami tower lease. This deal establishes a new benchmark for the cit...