Blog

Honest about AI detectors: what our heuristic measures, and what it doesn't

Inksong shows a before-and-after AI-likelihood score. It's a heuristic — not a third-party guarantee. Here's exactly what's in it.

May 9, 2026 · 3 min read

title: "Honest about AI detectors: what our heuristic measures, and what it doesn't" description: "Inksong shows a before-and-after AI-likelihood score. It's a heuristic — not a third-party guarantee. Here's exactly what's in it." date: "2026-05-09" tag: "Product" author: "Inksong"

Inksong shows a 0–100 score before and after humanization. We label it as a heuristic on every page where it appears. We want to explain what's actually in it — both because it's good product hygiene and because the AI-detection space is full of overclaims that haven't aged well.

What the score is

A weighted sum of these signals:

  • Sentence-length variance (burstiness). AI-generated text trends toward uniform sentence lengths. Human writing varies more, especially across paragraph transitions. We measure the standard deviation of sentence-length-in-tokens across the document, normalized by document length.
  • Token-level repetition. Certain bigrams ("It is", "Furthermore,", "In conclusion,") appear with characteristic frequency in default-mode LLM output. We don't blacklist them — they're perfectly fine in human writing — but their density in a passage is a signal.
  • Hedge-word density. "Generally," "typically," "it can be argued that," "this suggests." LLMs over-hedge by default. Human writers hedge too, but with more variation in how they hedge (active subjects, specific authorities).
  • Vocabulary entropy. How predictable is each next word given the previous few? AI text is on average more predictable than human text in the same domain. We approximate this with a small in-house n-gram model trained on a mixed corpus.
  • Function-word distribution. The ratio of content-words to function-words ("the", "of", "in") behaves slightly differently in AI vs human writing across domains.

We sum these into a 0–100 number. Higher means more AI-like under our heuristic.

What the score is not

  • It is not Turnitin. It's not GPTZero. We didn't run those services in a back-test and we don't claim parity. They use proprietary classifiers and they change all the time.
  • It is not a guarantee that an academic-integrity tool won't flag your document. Detection tooling is fast-moving and noisy. If you're submitting work where a third-party detector is the gating mechanism, you should ultimately rewrite by hand and review.
  • It is not a measure of writing quality. A well-written human academic paper can score moderately high on burstiness uniformity. A badly-written human email can score low. The score measures resemblance to default-mode AI generation, not whether the prose is good.
  • It is not trained on your specific use case. We don't fine-tune the detector per user. Domain-aware detection is on the roadmap; right now the same heuristic applies whether you're writing legal opinions or marketing copy.

Why a heuristic at all?

Because the feedback loop matters. If you humanize a document and have no idea whether it moved the needle, you can't iterate. A small in-house score gives you that signal cheaply and instantly. We'd rather show an imperfect number with full disclosure than send you to a third-party tool that's also imperfect and charges for it.

When we have published comparisons against third-party detectors and a domain-aware version of the score, we'll write about it here.

How to read the number, honestly

  • A drop from 85 → 25 is real signal that the document reads more naturally on our axes.
  • That doesn't mean third-party detectors will agree.
  • If you're optimizing for a specific detector, the right workflow is: humanize with Inksong, run your specific detector externally, iterate on the parts it flags.

If we ever start claiming Inksong scores guarantee third-party bypass, tell us to stop. We'd rather lose the sale than make a promise we can't keep.

Start humanizing today

5 documents free a month, no card needed. Three minutes to your first humanized doc.

  • 5 documents/month on the free tier
  • No credit card required
  • Cancel or upgrade anytime