Lists (15)
Sort Name descending (Z-A)
Stars
"Pooling And Attention: What Are Effective Designs For LLM-Based Embedding Models?"
搜索、推荐、广告、用增等工业界实践文章收集(来源:知乎、Datafuntalk、技术公众号)
The implementation of FINER-MLLM, which is accepted by MM2024.
Tevatron - A flexible toolkit for neural retrieval research and development.
[ACM MM 2024] Improving Composed Image Retrieval via Contrastive Learning with Scaling Positives and Negatives
【ICLR 2024, Spotlight】Sentence-level Prompts Benefit Composed Image Retrieval
LAVIS - A One-stop Library for Language-Vision Intelligence
Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复
(ICML 2024) Improve Context Understanding in Multimodal Large Language Models via Multimodal Composition Learning
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
The huggingface implementation of Fine-grained Late-interaction Multi-modal Retriever.
The code used to train and run inference with the ColPali architecture.
This is the official repository for Retrieval Augmented Visual Question Answering
pytablewriter is a Python library to write a table in various formats: AsciiDoc / CSV / Elasticsearch / HTML / JavaScript / JSON / LaTeX / LDJSON / LTSV / Markdown / MediaWiki / NumPy / Excel / Pan…
Generative Representational Instruction Tuning
Code for "Are “Hierarchical” Visual Representations Hierarchical?" in NeurIPS Workshop for Symmetry and Geometry in Neural Representations.
A lightweight open-source package to fine-tune embedding models.
【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
Mixture-of-Experts for Large Vision-Language Models
Implementation of PALI3 from the paper PALI-3 VISION LANGUAGE MODELS: SMALLER, FASTER, STRONGER"
Zero-shot Document Ranking with Large Language Models.
Official repository of ICCV 2021 - Image Retrieval on Real-life Images with Pre-trained Vision-and-Language Models
RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.
(AAAI 2024) BLIVA: A Simple Multimodal LLM for Better Handling of Text-rich Visual Questions
Conversational Recommender System (CRS) paper list. 对话推荐系统论文列表