MonkeyScore - AI Assurance Metrics Analyzer

Measure 15 AI Assurance Metrics from any LLM output. Inline annotations, mode presets, trend charts. 100% local.

As of June 2026, MonkeyScore - AI Assurance Metrics Analyzer has 10 users in the Productivity category.

Usersno change0%
10
10
Ratingno change0%
— reviews
Reviewsno change0%
Version
2.2.3
Manifest V3
90-day change · In the last 90 days this extension 1 version update.

History

9 snapshots

Tracking since Apr 1, 2026.

13.32118.68Apr 1, 2026Jun 9, 2026
View as table
DateUsersRatingReviewsVersion
Apr 1, 2026102.2.1
Apr 16, 202692.2.1
Apr 22, 2026102.2.3
Apr 27, 2026112.2.3
May 4, 20262.2.3
May 21, 2026132.2.3
May 28, 2026102.2.3
Jun 4, 2026112.2.3
Jun 9, 2026122.2.3
Now102.2.3

Changelog

  • Apr 16, 2026
    description
    sidemind.ai is a Chrome extension that measures 15 AI Assurance Metrics from any LLM output. 100% local. Zero data leaves your browser.
    
    What's new in v2.2- Contextual Awareness:
    
    */Platform Auto-Detection
    The extension automatically recognizes which AI platform you're using. 11 major platforms supported- scoring adjusts accordingly.
    
    */Platform-Specific Weighted Scoring
    Not all AI models fail the same way. Each platform now has a tailored scoring profile based on known failure modes- hallucination, groundedness, policy compliance, security, and more are weighted differently depending on where you're analyzing.
    
    */Prompt-Response Alignment
    Provide the original prompt and the engine measures how well the output actually addresses it- term coverage, keyphrase matching, intent detection, and a 0-100 alignment score. It even lists the specific prompt terms the response missed.
    
    */Auto Prompt Extraction
    On supported platforms, the extension automatically detects your prompt from the page. No copy-paste needed.
    
    */Context-Aware Metrics
    5 of the 15 metrics now factor in platform and prompt context for deeper, more meaningful scoring.
    
    */Smarter Performance
    Debounced DOM scanning means the extension stays fast even during streaming responses.
    
    This started as a simple idea: what if you could score any AI output the way you'd audit a model in production?
    
    15 metrics. 6 categories. Accuracy, hallucination, groundedness, bias, robustness, drift, explainability, compliance, security, privacy, human override, reliability, latency, and business impact.
    
    All running locally in your browser.
    sidemind.ai is a Chrome extension that measures 15 AI Assurance Metrics from any LLM output. 100% local. Zero data leaves your browser.
    
    What's New in v2.2.2
    🔍 Inline Sentence Annotations
    Highlights problematic text directly inside LLM outputs — hallucinations (red), PII leaks (pink), security injections (dark red), bias/stereotypes (amber), speculative language (cyan), and vague attributions (indigo). Skips code blocks to avoid false positives. Toggle visibility with the eye icon in the results header.
    📈 Metric Trend Charts
    New "Trends" view in the History tab. Filter by platform, see your overall score plotted over time as an SVG line chart, and browse a 15-card sparkline grid showing each metric's trajectory with ▲/▼/→ deltas from your previous session.
    🎯 Analysis Mode Presets
    Six scoring modes — General, Research, Developer, Compliance, Healthcare, and Marketing — each with tailored weight profiles. Modes combine with platform weights using an additive model (capped at 1.60×). Persists across sessions, appears as selectable chips in both the popup and on-page panel, and shows as a badge in results and exported reports.
    
    
    Key features
    
    */Platform Auto-Detection
    The extension automatically recognizes which AI platform you're using. 11 major platforms supported- scoring adjusts accordingly.
    
    */Platform-Specific Weighted Scoring
    Not all AI models fail the same way. Each platform now has a tailored scoring profile based on known failure modes- hallucination, groundedness, policy compliance, security, and more are weighted differently depending on where you're analyzing.
    
    */Prompt-Response Alignment
    Provide the original prompt and the engine measures how well the output actually addresses it- term coverage, keyphrase matching, intent detection, and a 0-100 alignment score. It even lists the specific prompt terms the response missed.
    
    */Auto Prompt Extraction
    On supported platforms, the extension automatically detects your prompt from the page. No copy-paste needed.
    
    */Context-Aware Metrics
    5 of the 15 metrics now factor in platform and prompt context for deeper, more meaningful scoring.
    
    */Smarter Performance
    Debounced DOM scanning means the extension stays fast even during streaming responses.
    
    This started as a simple idea: what if you could score any AI output the way you'd audit a model in production?
    
    15 metrics. 6 categories. Accuracy, hallucination, groundedness, bias, robustness, drift, explainability, compliance, security, privacy, human override, reliability, latency, and business impact.
    
    All running locally in your browser.
  • Apr 16, 2026
    short_description
    Measure 15 AI Assurance Metrics from any LLM output. Platform-aware scoring and prompt alignment. 100% local.
    Measure 15 AI Assurance Metrics from any LLM output. Inline annotations, mode presets, trend charts. 100% local.
  • Apr 16, 2026
    name
    sidemind.ai - AI Assurance Metrics Analyzer
    MonkeyScore - AI Assurance Metrics Analyzer

Permissions & access

Permissions
activeTabstorage
Host access
None declared

Screenshots

MonkeyScore - AI Assurance Metrics Analyzer screenshot 1MonkeyScore - AI Assurance Metrics Analyzer screenshot 2MonkeyScore - AI Assurance Metrics Analyzer screenshot 3MonkeyScore - AI Assurance Metrics Analyzer screenshot 4MonkeyScore - AI Assurance Metrics Analyzer screenshot 5

About

sidemind.ai is a Chrome extension that measures 15 AI Assurance Metrics from any LLM output. 100% local. Zero data leaves your browser.

What's New in v2.2.2
🔍 Inline Sentence Annotations
Highlights problematic text directly inside LLM outputs — hallucinations (red), PII leaks (pink), security injections (dark red), bias/stereotypes (amber), speculative language (cyan), and vague attributions (indigo). Skips code blocks to avoid false positives. Toggle visibility with the eye icon in the results header.
📈 Metric Trend Charts
New "Trends" view in the History tab. Filter by platform, see your overall score plotted over time as an SVG line chart, and browse a 15-card sparkline grid showing each metric's trajectory with ▲/▼/→ deltas from your previous session.
🎯 Analysis Mode Presets
Six scoring modes — General, Research, Developer, Compliance, Healthcare, and Marketing — each with tailored weight profiles. Modes combine with platform weights using an additive model (capped at 1.60×). Persists across sessions, appears as selectable chips in both the popup and on-page panel, and shows as a badge in results and exported reports.


Key features

*/Platform Auto-Detection
The extension automatically recognizes which AI platform you're using. 11 major platforms supported- scoring adjusts accordingly.

*/Platform-Specific Weighted Scoring
Not all AI models fail the same way. Each platform now has a tailored scoring profile based on known failure modes- hallucination, groundedness, policy compliance, security, and more are weighted differently depending on where you're analyzing.

*/Prompt-Response Alignment
Provide the original prompt and the engine measures how well the output actually addresses it- term coverage, keyphrase matching, intent detection, and a 0-100 alignment score. It even lists the specific prompt terms the response missed.

*/Auto Prompt Extraction
On supported platforms, the extension automatically detects your prompt from the page. No copy-paste needed.

*/Context-Aware Metrics
5 of the 15 metrics now factor in platform and prompt context for deeper, more meaningful scoring.

*/Smarter Performance
Debounced DOM scanning means the extension stays fast even during streaming responses.

This started as a simple idea: what if you could score any AI output the way you'd audit a model in production?

15 metrics. 6 categories. Accuracy, hallucination, groundedness, bias, robustness, drift, explainability, compliance, security, privacy, human override, reliability, latency, and business impact.

All running locally in your browser.

Technical

Version
2.2.3
Manifest
V3
Size
81.48KiB
Min Chrome
88
Languages
1
Featured
No

Metadata

ID
adkbefgpmjmpemmpoejhoicpbbamackp
Developer ID
uebde7338f38802522363dbf1aaf7e8ab
Developer Email
[email protected]
Created
Feb 26, 2026
Last Updated (Store)
Apr 10, 2026
Last Scraped
Jun 9, 2026
Website
Support URL
Privacy Policy

Similar extensions

Alternatives to MonkeyScore - AI Assurance Metrics Analyzer, ranked by description similarity.

Data sourced from the Chrome Web Store · last verified Jun 9, 2026.