Working on a project that needed both semantic search and content moderation, so I built an API that handles both.
The problem it solves: Expensive GPU instances required for inference, hard to scale infrastructure. Most teams give up quickly after realizing the infrastructure needed to handle this.
What it does: Semantic search + content moderation. You can search images by describing them ("girl with guitar") or find text by meaning ("movie about billionaire in flying suit" → Iron Man). Plus NSFW detection with specific labels.
Stack:
- Rust Candle for ML models (Clip)
- Rust Axum + Tokio for the API
- Vector DB for search
I am considering switching to a more lightweight CLIP based model like mobileclip or clip quantized. What do you guys think?
[link] [comments]
Leave a Reply
You must be logged in to post a comment.