[go: nahoru, domu]

Skip to content
View lucky-star-king's full-sized avatar

Highlights

  • Pro

Block or report lucky-star-king

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

(ICML 2024) Improve Context Understanding in Multimodal Large Language Models via Multimodal Composition Learning

Python 14 Updated Sep 5, 2024

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Python 16,608 1,699 Updated Sep 9, 2024

The huggingface implementation of Fine-grained Late-interaction Multi-modal Retriever.

Python 64 4 Updated Sep 3, 2024

The code used to train and run inference with the ColPali architecture.

Python 451 46 Updated Sep 9, 2024

This is the official repository for Retrieval Augmented Visual Question Answering

Python 157 14 Updated Sep 3, 2024

pytablewriter is a Python library to write a table in various formats: AsciiDoc / CSV / Elasticsearch / HTML / JavaScript / JSON / LaTeX / LDJSON / LTSV / Markdown / MediaWiki / NumPy / Excel / Pan…

Python 605 43 Updated Feb 11, 2024

Generative Representational Instruction Tuning

Jupyter Notebook 520 39 Updated Sep 3, 2024

Code for "Are “Hierarchical” Visual Representations Hierarchical?" in NeurIPS Workshop for Symmetry and Geometry in Neural Representations.

Jupyter Notebook 18 1 Updated Nov 8, 2023

A lightweight open-source package to fine-tune embedding models.

Python 15 Updated Feb 4, 2024

【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment

Python 675 50 Updated Mar 25, 2024

本人的科研经验

5,437 329 Updated Aug 29, 2024

Mixture-of-Experts for Large Vision-Language Models

Python 1,898 121 Updated May 15, 2024

Implementation of PALI3 from the paper PALI-3 VISION LANGUAGE MODELS: SMALLER, FASTER, STRONGER"

Python 136 2 Updated Sep 9, 2024

Zero-shot Document Ranking with Large Language Models.

Python 84 7 Updated Jul 4, 2024

Official repository of ICCV 2021 - Image Retrieval on Real-life Images with Pre-trained Vision-and-Language Models

97 3 Updated Jul 31, 2024

RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.

Python 295 38 Updated Sep 7, 2024

official repository for ListT5

Python 32 Updated Aug 29, 2024

(AAAI 2024) BLIVA: A Simple Multimodal LLM for Better Handling of Text-rich Visual Questions

Python 257 26 Updated Apr 14, 2024

Conversational Recommender System (CRS) paper list. 对话推荐系统论文列表

123 25 Updated Nov 24, 2022

E5-V: Universal Embeddings with Multimodal Large Language Models

Python 142 6 Updated Jul 17, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 19,176 2,100 Updated Aug 12, 2024

Collection of Composed Image Retrieval (CIR) papers.

65 3 Updated Jul 26, 2024

[ICLR 2023] This is the code repo for our ICLR‘23 paper "Universal Vision-Language Dense Retrieval: Learning A Unified Representation Space for Multi-Modal Retrieval".

Python 41 6 Updated Jul 3, 2024

[ACL 2024] This is the code repo for our ACL‘24 paper "MARVEL: Unlocking the Multi-Modal Capability of Dense Retrieval via Visual Module Plugin".

Python 22 3 Updated Jun 30, 2024

Counterfactual Regression

Python 291 82 Updated Dec 7, 2022

History-Aware Conversational Dense Retrieval. A codebase for ACL 2024 Findings accepted paper.

Python 10 Updated Jun 2, 2024
Python 8 Updated Feb 22, 2024

This is the repository for the GenIR survey.

100 6 Updated Aug 2, 2024

a simple vae and cvae from keras

Python 1,245 377 Updated May 18, 2021
Next