All creators
AI WITH Rithesh

AI WITH Rithesh

Open-Source OCR, Audio Models, and Developer Tools with a focus on live implementation and stress-testing.

Rating
7.6
ReReview score
Award
Worth Prioritizing
Chart
#44
AI & Software Tools
Subscribers
51K
YouTube
Age
5y 9m
Channel age

Nutrition Label

Rithesh delivers practical, code-first evaluations of emerging AI models, prioritizing live demonstrations in Google Colab over marketing slides. He rigorously stress-tests tools against difficult edge cases—such as handwriting or complex audio—often revealing failure modes that official benchmarks miss. While his technical testing is grounded and authentic, his video titles sometimes oversell a model's capabilities compared to his actual, more nuanced findings.

Strengths

  • +Live Code Testing
  • +Edge Case Validation
  • +Resource Usage Analysis

Notes

  • !Titles sometimes promise revolutionary performance that the video's actual testing proves to be merely incremental.
  • !Watch for specific hardware metrics like GPU RAM usage, which he consistently verifies during live runs.

Rating Breakdown

Experience Authenticity
8.3
Rigor & Evidence
7.3
Original Analysis
6.4
Technical Depth
6.7
Disclosure Clarity
7.1
Title-Content Alignment
8.2
Expertise Signal
7.2
Communication Effectiveness
7.0

Breakdown across the key dimensions we rate. Methodology →

Recent Videos

Qwen 3.5 Just Dropped  And It Claims to Outperform GPT-5.2, Gemini & Claude at 60% the Cost!
Pending

Qwen 3.5 Just Dropped And It Claims to Outperform GPT-5.2, Gemini & Claude at 60% the Cost!

Feb 18, 2026 • 170 views
SaarasV3 Next Generation Indic Languages Speech Recognition model beats Gemini 3 Pro
Pending

SaarasV3 Next Generation Indic Languages Speech Recognition model beats Gemini 3 Pro

Feb 12, 2026 • 119 views
Sarvam Vision SOTA OCR for 22 Indian Languages + English beats Frontier Models
Scored

Sarvam Vision SOTA OCR for 22 Indian Languages + English beats Frontier Models

Feb 6, 2026 • 1.1K views
moltbook : Social Network for AI Agents ABSOLUTE CHAOS
Scored

moltbook : Social Network for AI Agents ABSOLUTE CHAOS

Jan 31, 2026 • 840 views
PaddleOCR-VL-1.5 New SOTA OCR Underwhelming?
Scored

PaddleOCR-VL-1.5 New SOTA OCR Underwhelming?

Jan 30, 2026 • 472 views
DeepSeek OCR 2 — A Tiny 3B Model Beating the Best 🤯
Scored

DeepSeek OCR 2 — A Tiny 3B Model Beating the Best 🤯

Jan 27, 2026 • 1.5K views
Microsoft’s New ASR Transcribes 60-Minute Audio in One Shot—with Speakers & Timestamps Open Source
Scored

Microsoft’s New ASR Transcribes 60-Minute Audio in One Shot—with Speakers & Timestamps Open Source

Jan 24, 2026 • 181 views
Qwen3-TTS Can Clone Any Voice and It’s Scarily Good Open Source Too
Scored

Qwen3-TTS Can Clone Any Voice and It’s Scarily Good Open Source Too

Jan 23, 2026 • 419 views
Pocket TTS  CPU Only Lightweight TTS Voice Cloning
Scored

Pocket TTS CPU Only Lightweight TTS Voice Cloning

Jan 19, 2026 • 347 views
Inside the 2025 AI Shock: The Chinese Labs Outpacing the West
Scored

Inside the 2025 AI Shock: The Chinese Labs Outpacing the West

Dec 27, 2025 • 341 views
Microsoft VibeVoice-Realtime: Lightning-Fast TTS for Live Streams & Instant Speech from Any Model!
Scored

Microsoft VibeVoice-Realtime: Lightning-Fast TTS for Live Streams & Instant Speech from Any Model!

Dec 8, 2025 • 368 views
HunyuanOCR  Best Free OCR from China blows away the competition  Extensive Testing Colab Demo
Scored

HunyuanOCR Best Free OCR from China blows away the competition Extensive Testing Colab Demo

Nov 29, 2025 • 685 views
Google Nano Banana Pro 🍌🍌 : Ultimate AI for image generation + editing
Pending

Google Nano Banana Pro 🍌🍌 : Ultimate AI for image generation + editing

Nov 21, 2025 • 116 views
Gemini 3  New Era of Intelligence Begins — First Tests, Shock Results, and FREE Access
Pending

Gemini 3 New Era of Intelligence Begins — First Tests, Shock Results, and FREE Access

Nov 19, 2025 • 259 views
Kimi K2 Thinking vs Qwen 3 Max Thinking Battle of the Heavyweight Reasoning models
Pending

Kimi K2 Thinking vs Qwen 3 Max Thinking Battle of the Heavyweight Reasoning models

Nov 7, 2025 • 527 views

Why this rating

Evidence receipts showing why each dimension is rated the way it is.

Experience Authenticity10/10
Let's start with our usual set of images... I want to do this simple tabular data.
[3:42]

The creator demonstrates direct engagement by uploading his own diverse dataset to the tool's playground for live testing.

Problem Encounter10/10
Here is a small mistake over here. It is actually some 22 percentage... so it says 2 over here.
[6:37]

The analyst identifies a specific data extraction error in a chart, proving he is verifying the output rather than just accepting it.

Expertise Signal8/10
Some other OCRs actually fail on this document altogether. For example, Chandra OCR I've seen... they fail because this language is Kannada.
[5:38]

He contextualizes the performance by comparing it to specific competitors and their known limitations with Indian languages.

Title-Content Alignment5/10
I find this OCR on par with other OCRs like OLMOCR 2... even though they claim a really high percentage.
[8:17]

The title claims the model 'blows away the competition,' but the video conclusion is much more measured, stating it is merely 'on par' with existing tools.

Categories
Automation & AgentsAudio & VoiceData & AnalyticsDeveloper PlatformsResearch Tools
Formats
ReviewsTutorials