
Niloy R
Fast and Efficient
Skills

See my services


Work experience
Research Assistant
ResearchCollab • Part-time
Sep 2025 - Present • 9 mos
Developed a multimodal AI evaluation framework for Video Question Answering models using PyTorch, Hugging Face Transformers, OpenCV, CUDA, and Python. ○ Built reusable model-adapter pipelines for Qwen-VL, LLaVA, SmolVLM, InternVL, X-CLIP, and related vision language models, standardizing inference, confidence scoring, metadata logging, and result comparison. ○ Implemented attention visualization workflows to analyze model reasoning across video frames using patch-level heatmaps, frame overlays, and spatio-temporal attention summaries. ○ Designed model evaluation metrics including spatial IoU, Dice/F1, AUPRC, AUROC, temporal IoU, top-k frame recall, and deletion/insertion-based faithfulness testing. ○ Automated batch experiments, generated JSON/CSV evaluation reports, and prepared reproducible outputs for research review, technical documentation, and publication support.
3 Reviews
| (3) | ||
| (0) | ||
| (0) | ||
| (0) | ||
| (0) |
Rating Breakdown
- Seller communication level
- Recommend to a friend
- Service as described
Sort By

khgvlhb

Egypt

Seller's Response
croatiasman

Sweden
Seller's Response

fastneasytax

Canada
Worked with me to ensure that solution works. Implemented v2 captcha without asking for another order when v3 captcha didn't stop spam

Seller's Response
