Search Results for author: Lan Mu

Found 3 papers, 1 papers with code

Img2Loc: Revisiting Image Geolocalization using Multi-modality Foundation Models and Image-based Retrieval-Augmented Generation

no code implementations • 28 Mar 2024 • Zhongliang Zhou, Jielu Zhang, Zihan Guan, Mengxuan Hu, Ni Lao, Lan Mu, Sheng Li, Gengchen Mai

Geolocating precise locations from images presents a challenging problem in computer vision and information retrieval. Traditional methods typically employ either classification, which dividing the Earth surface into grid cells and classifying images accordingly, or retrieval, which identifying locations by matching images with a database of image-location pairs.

Retrieval Text Generation

Paper
Add Code

On the Promises and Challenges of Multimodal Foundation Models for Geographical, Environmental, Agricultural, and Urban Planning Applications

no code implementations • 23 Dec 2023 • Chenjiao Tan, Qian Cao, Yiwei Li, Jielu Zhang, Xiao Yang, Huaqin Zhao, Zihao Wu, Zhengliang Liu, Hao Yang, Nemin Wu, Tao Tang, Xinyue Ye, Lilong Chai, Ninghao Liu, Changying Li, Lan Mu, Tianming Liu, Gengchen Mai

The advent of large language models (LLMs) has heightened interest in their potential for multimodal applications that integrate language and vision.

Image Classification Land Cover Classification +5

Paper
Add Code

Text2Seg: Remote Sensing Image Semantic Segmentation via Text-Guided Visual Foundation Models

1 code implementation • 20 Apr 2023 • Jielu Zhang, Zhongliang Zhou, Gengchen Mai, Lan Mu, Mengxuan Hu, Sheng Li

We developed a pipeline that leverages multiple FMs to facilitate remote sensing image semantic segmentation tasks guided by text prompt, which we denote as Text2Seg.

Instance Segmentation Segmentation Of Remote Sensing Imagery +2

Paper
Code

Cannot find the paper you are looking for? You can Submit a new open access paper.