Product

Vector Retrieval and GPU Acceleration

Exploration of the development of vector retrieval technology, transitioning from traditional CPU-based indexing methods to modern GPU-accelerated solutions. Highlights various classic algorithms in Approximate Nearest Neighbor Search (ANNS) and their applications in large-scale data processing, particularly focusing on their crucial role in fast semantic search and real-time decision support in the medical field.

vector retrieval GPU acceleration ANNS

Non-pathological image filtering in pathological multimodal data sets

Existing open-source pathology text-image paired datasets (e.g., Quilt-1m) are constructed by extracting frames from YouTube videos, though initial filtering strategies have been applied, significant noise (e.g., non-pathological images) remains. Training classifiers on datasets of varying scales and architectures demonstrates substantial performance disparities among different models. Experimental results further indicate that fine-tuning large models using an optimized dataset (filtered to exclude non-pathological data) significantly enhances their performance in downstream tasks.

Data Cleaning Data Filtering Domain-specific

A Training Free Algorithm for Patch Level Quality Control in Pathological Images

The production process of pathological digital slides involves multiple critical steps, and potential quality issues in any of these steps may lead to defects such as image defocusing and tissue overlap. These abnormal regions result in the loss of pathological structural information, significantly compromising the accuracy and reliability of clinical diagnoses. Therefore, there is an urgent need to develop a rapid and efficient algorithmic framework to precisely identify and filter problematic regions, while further investigating the interference mechanisms and quantifying the impact of such low-quality image patches on the training of intelligent pathological analysis models.

Quality Control Data Filtering Training-Free

Academics | The Hong Kong University of Science and Technology

Vector Retrieval and GPU Acceleration

Non-pathological image filtering in pathological multimodal data sets

A Training Free Algorithm for Patch Level Quality Control in Pathological Images

Non-pathological Image Filtering

MLLM Evaluation in Breast Cancer

Scaling law for pathology

CLAM-based Image-Caption Generation

KB-enhanced Pathology CLIP (public datasets)

Pathology Image-Caption Evaluation

Image-Caption Data Market Demo

GraphRAG Based Pathology LLM

Pathology Image-Text Structured Alignment Based on Multiple Instance Learning

Pivot：Enhancing Pathology Image-Text Alignment with a Pathology Knowledge Base

RIVL：Addressing Modality Missing in Pathology Image-Text Alignment Using Interpolation

Keywords

Academics | The Hong Kong University of Science and Technology

香港科技大学(广州)- 安必平

医疗数据智能联合实验中心

Vector Retrieval and GPU Acceleration

Non-pathological image filtering in pathological multimodal data sets

A Training Free Algorithm for Patch Level Quality Control in Pathological Images

Non-pathological Image Filtering

MLLM Evaluation in Breast Cancer

Scaling law for pathology

CLAM-based Image-Caption Generation

KB-enhanced Pathology CLIP (public datasets)

Pathology Image-Caption Evaluation

Image-Caption Data Market Demo

GraphRAG Based Pathology LLM

Pathology Image-Text Structured Alignment Based on Multiple Instance Learning

Pivot：Enhancing Pathology Image-Text Alignment with a Pathology Knowledge Base

RIVL：Addressing Modality Missing in Pathology Image-Text Alignment Using Interpolation

Keywords