mmaaz60

Follow

😀

Muhammad Maaz mmaaz60

😀

Follow

An Electrical Engineer with experience in Computer Vision software development. Skilled in Machine Learning, Deep Learning and Computer Vision.

134 followers · 4 following

Achievements

Achievements

Organizations

mmaaz60/README.md

Hi there 👋

🔭 I’m currently working on multi-modal transformers and multi-task learning
🌱 I’m currently learning to play Table Tennis 🏓
📫 How to reach me: muhammad.maaz@mbzuai.ac.ae

Pinned Loading

mbzuai-oryx/Video-ChatGPT mbzuai-oryx/Video-ChatGPT Public

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted fo…

Python 1.2k 107
mbzuai-oryx/groundingLMM mbzuai-oryx/groundingLMM Public

[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.

Python 772 38
mbzuai-oryx/VideoGPT-plus mbzuai-oryx/VideoGPT-plus Public

Official Repository of paper VideoGPT+: Integrating Image and Video Encoders for Enhanced Video Understanding

Python 213 14
mbzuai-oryx/LLaVA-pp mbzuai-oryx/LLaVA-pp Public

🔥🔥 LLaVA++: Extending LLaVA with Phi-3 and LLaMA-3 (LLaVA LLaMA-3, LLaVA Phi-3)

Python 806 59
mbzuai-oryx/PALO mbzuai-oryx/PALO Public

(WACV 2025) Vision-language conversation in 10 languages including English, Chinese, French, Spanish, Russian, Japanese, Arabic, Hindi, Bengali and Urdu.

Python 81 5
EdgeNeXt EdgeNeXt Public

[CADL'22, ECCVW] Official repository of paper titled "EdgeNeXt: Efficiently Amalgamated CNN-Transformer Architecture for Mobile Vision Applications".

Python 345 39