1 code implementation • 23 Mar 2024 • HAZ Sameen Shahgir, Khondker Salman Sayeed, Abhik Bhattacharjee, Wasi Uddin Ahmad, Yue Dong, Rifat Shahriyar
GPT4V, the best-performing VLM, achieves 62. 99% accuracy (4-shot) on the comprehension task and 49. 7% on the localization task (4-shot and Chain-of-Thought).
Ranked #1 on Object Localization on IllusionVQA
no code implementations • 22 Jan 2024 • HAZ Sameen Shahgir, Khondker Salman Sayeed, Md Toki Tahmid, Tanjeem Azwad Zaman, Md. Zarif Ul Alam
Recent advances in Deep Learning and Computer Vision have been successfully leveraged to serve marginalized communities in various contexts.
no code implementations • 21 Oct 2023 • H. A. Z. Sameen Shahgir, Khondker Salman Sayeed, Tanjeem Azwad Zaman, Md. Asif Haider, Sheikh Saifur Rahman Jony, M. Sohel Rahman
This report outlines our approach in the IEEE SPS VIP Cup 2023: Ophthalmic Biomarker Detection competition.
2 code implementations • 19 Mar 2023 • H. A. Z. Sameen Shahgir, Khondker Salman Sayeed
This paper presents a method for detecting grammatical errors in Bangla using a Text-to-Text Transfer Transformer (T5) Language Model, using the small variant of BanglaT5, fine-tuned on a corpus of 9385 sentences where errors were bracketed by the dedicated demarcation symbol.
no code implementations • 11 Sep 2022 • H. A. Z. Sameen Shahgir, Khondker Salman Sayeed, Tanjeem Azwad Zaman
After training for 71 epochs, on a training set consisting of 36919 mp3 files, we achieved a training loss of 0. 3172 and WER of 0. 2524 on a validation set of size 7, 747.