A C++ implementation of MobileClip image classification using TensorRT for high-performance inference. mobileclip_cpp_trt/ ├── CMakeLists.txt # CMake build configuration ├── README.md # This file ├── ...
Train vision-language models (VLMs) with reinforcement learning using Group Relative Policy Optimization (GRPO) on multimodal image+text tasks via the verl framework. This workflow trains ...