Kommento Safety Index API – Sample Snapshot

A visual slice of the first few hundred domains our crawler processed. Each card shows the trust score, color tint, technical signals, and how Kommento would treat that host.

What we Collect in code :

At the code level, Kommento exposes a small Safety API surface for crawling and scoring the web, and returns structured records for each URL and host.

Kommento Safety API (developer surface)

shape: SafetyAPI

// High-level operations (pseudocode)

// Crawl & build the Safety Index
crawlWeb(seedList, options) {
  // fetch URLs, parse HTML, compute technical signals
  // write SafetyRecord rows into the Safety Index
}

// Get a full safety record for a URL
getSafetyRecord(url) => SafetyRecord {
  // returns trustScore, tint, signals, categories, reasonCode, timestamps
}

// Check if a URL is allowed for a given profile
isBlocked(url, profile) => boolean {
  // profile: "adult_default" | "teen" | "kid"
  // uses SafetyRecord + HostProfile + policy
}

// Get an aggregated host profile (what we store/license)
getHostProfile(domain) => HostProfile {
  // returns trust band, categories, defaultDecision, profileDecisions
}

This is the narrow “front door” for engineers: crawl, get a score, ask if something is blocked for a profile, or pull the aggregated host profile.

Per-URL record (raw SafetyRecord)

shape: SafetyRecord

{
  "url": "https://example.com/",
  "urlHash": "abcd1234…",
  "domain": "example.com",

  "trustScore": 93,
  "tint": "green",
  "riskTier": "low",

  "categoryMask": 0,            // bitfield: PORN, SCAM, etc.
  "signalMask": 8,
  "signals": ["MANY_THIRD_PARTY_SCRIPTS"],

  "trackerCount": 3,
  "thirdPartyScriptCount": 7,
  "isHttps": true,
  "hasMixedContent": false,
  "hasInsecureForm": false,

  "aiScore": 0,
  "modelVersion": 3,
  "reasonCode": null,
  "lastCheckedAt": "2025-12-05T17:40:28.832Z"
}

Each crawl produces a raw SafetyRecord: one row per URL with a trustScore, tint color, technical signals, categories, and an optional reason code.

What we Store & License?

From these raw per-URL records, Kommento builds stable HostProfiles: per-domain trust bands, categories, and ALLOW / WARN / BLOCK decisions for each profile. This is the dataset we can store, analyze, and license.

Per-host summary (HostProfile)

shape: HostProfile

{
  "domain": "example.com",
  "rootDomain": "example.com",

  "trustBand": "GREEN_HIGH",         // GREEN_HIGH, GREEN, LIGHT_GREEN, YELLOW, ORANGE, RED_LOW
  "riskTier": "low",                 // low, medium, elevated, high, blocked

  "contentCategories": [],           // e.g. ["PORN"]
  "securityCategories": [],          // e.g. ["MALWARE","PHISHING"]

  "defaultDecision": "ALLOW",
  "profileDecisions": {
    "adult_default": "ALLOW",
    "teen": "ALLOW",
    "kid": "ALLOW"
  },

  "primaryReason": "TECHNICAL_CLEAN",
  "secondarySignals": ["HTTPS_OK"],

  "firstSeenAt": "2025-12-05T17:40:28.832Z",
  "lastReviewedAt": "2025-12-05T17:40:28.832Z",
  "sourceFeeds": ["kommento_crawler"]
}

This aggregated HostProfile is what downstream products consume: a compressed, explainable verdict per domain that can power extensions, parental controls, ISPs, and security tools.

Trusted / Core Infrastructure

Big platforms and government sites scoring in the 90–100 band. These become automatic green “anchor” signals in Kommento’s SERP overlays and toolbars.

google.com

trustScore: 100

green signals: none

Tier-1 global search provider. Clean HTTPS, no mixed content; becomes a “trusted green” anchor in Kommento.

whitehouse.gov

trustScore: 100

green signals: none

Official U.S. government site. Automatically treated as highly trustworthy and highlighted for users.

stripe.com

trustScore: 100

green signals: none

Major payment processor. Clean technical posture; foundation for “safe to transact” hints in Kommento.

who.int

trustScore: 95

green signals: MANY_THIRD_PARTY_SCRIPTS

World Health Organization. High trust score, with minor technical noise from third-party scripts.

Mixed / “Watch This” Zones

Still generally safe, but our scanner sees mixed content or heavy third-party script use. These show up as yellow or light green in SERPs to draw the eye without scaring the user.

wordpress.org

trustScore: 85

lightgreen signals: MIXED_CONTENT

Open-source CMS. Good overall posture, but serves some assets over HTTP (mixed content) so we flag it as “slightly noisy”.

sourceforge.net

trustScore: 78

yellow MIXED_CONTENT MANY_THIRD_PARTY_SCRIPTS

Legacy download site. Our crawler sees mixed content plus many third-party scripts, so we downgrade to yellow as a visual “heads-up”.

typeform.com

trustScore: 75

yellow MIXED_CONTENT MANY_THIRD_PARTY_SCRIPTS

Form builder platform. Safe, but with enough embedded assets and mixed content that Kommento gently warns the user via a yellow tint.

exoclick.com

trustScore: 80

lightgreen MIXED_CONTENT MANY_THIRD_PARTY_SCRIPTS

Ad network. Technically acceptable but noisy from an ads/trackers standpoint. This is where future “privacy-aware” profiles can clamp down harder.

Blocked / High-Risk Zone

These are the sites Kommento would surface as red: porn, extreme tracking infrastructure, and eventually scams and malware. They all fall under 60 on the 0–100 scale.

upornia.com

trustScore: 20

red category: PORN signals: none

Explicit adult site. Detected via our porn host list and meta-content parser. Forced into the red band, ready for teen/kid profiles to auto-block.

adultfriendfinder.com

trustScore: 20

red category: PORN MANY_THIRD_PARTY_SCRIPTS

Adult dating / escort-style site. Classified as PORN and visually locked into red so it’s impossible to mistake for a safe result.

google-analytics.com

trustScore: 20

red MANY_TRACKERS

Tracking / analytics infrastructure. Not “malware”, but heavily classified as a tracker, so we push it into red for privacy-sensitive modes.

Unreachable / Error Cases

Error conditions (403, timeouts, certificate problems) are kept separate in a grey “no score” bucket. They don’t pollute trust scores but are still visible for diagnostics.

4chan.org

trustScore: —

grey HTTP 403 (forbidden)

The crawler could not fetch content (403). We treat this as an unreachable host rather than guess a score.

8chan.org

trustScore: —

grey Timeout

Connection timed out. Logged as an error so it’s visible in analytics, but excluded from trust scoring.

voat.co

trustScore: —

grey Certificate mismatch

TLS certificate does not match hostname. This is a strong technical red flag and is surfaced as a separate error row.