№ 01 / VOL. 2026 Research Memo · Tuesday · May 19 · 2026

A Data-Driven Playbook · Oriane Research

How AI Founders Win
on Video

An empirical look at 8,758 AI-themed videos posted to TikTok and Instagram between February and May 2026. We examine which structural choices - duration, pacing, vocabulary, hook, platform - correlate with engagement, and which do not.

Feb 19 - May 15, 2026 TikTok + Instagram 3,764 founders & creators Spoken-word transcripts

8,758

Videos
Analyzed

1.04B

Total
Views

51.8M

Total
Engagements

5.17%

Avg
Engagement Rate

3,764

Unique
Creators

483

Videos in
Shadow Reach

Executive Summary

An empirical view of how AI founders use short-form video.

The Lede Why this report exists

Between February 19 and May 15, 2026, Oriane's multimodal pipeline identified 8,758 AI-, founder-, and product-themed videos across TikTok and Instagram, accounting for roughly 1.04 billion views in the English-language subset. The dataset is large enough to surface structural patterns - on hook construction, duration, pacing, and platform behaviour - that are not yet well covered in founder-facing literature. This report sets out the most consistent of those patterns, and is candid about which findings are causal and which remain correlational.

FINDING 01

The "Hi, my name is" opener underperforms

Videos opening with a self-introduction ("Hi, my name is…", "Hello, I'm…") record the lowest mean engagement rate of any hook category we classified: 4.33% vs. 5.17% across all hooks. The instinctive opener that founders use on stage is the same one the algorithm appears to penalise on short-form video.

n = 235 videos · 4.33% ER · −16% vs baseline

FINDING 02

Faster delivery correlates with higher engagement

The relationship across pace buckets is monotonic. Videos delivered at 210+ words per minute record a mean ER of 5.57%; videos under 90 wpm record 3.52%. A measured, deliberate cadence - what we might call the "boardroom voice" - appears to underperform a faster, conversational one on this dataset.

n = 2,447 (fast) vs 68 (slow) · +58% ER gap

FINDING 03

Short captions outperform long captions

Videos with captions under 50 characters record a mean ER of 6.69%. Captions over 500 characters record 4.12%. The relationship is monotonic across all five buckets we tested. One interpretation: long captions read as paid or promotional, even when they are not; concise captions read as native.

n = 533 (short) vs 1,472 (long) · +62% ER gap

Monthly volume & reach

Feb–May 2026

Feb 2026

752

Mar 2026

2,610

Apr 2026

2,148

May 2026 (partial)

1,028

March recorded the highest video volume; April recorded the highest total views (574M), suggesting some of the March content compounded into April distribution. May figures are partial (data through May 15).

●

The Oriane Shadow Reach Layer: 483 videos in this dataset mention major AI labs (OpenAI, Anthropic, Google AI) only in spoken audio - never in a caption or hashtag. Those 483 videos accounted for 70.9 million views, none of which would surface in a text-based listening tool such as Brandwatch, Meltwater or Talkwalker. The shadow layer is not the headline of this report, but it is the methodological note worth flagging.

For OpenAI / ChatGPT specifically: 191 spoken-only videos accounting for 36.2M views. For Anthropic / Claude: 172 spoken-only videos · 24.0M views. Text-based competitive monitoring would miss roughly a third of the surface area on either brand.

The Top 1% Profile

What the highest-engagement 1% have in common

The top 1% of videos in this dataset (n=65) post a mean engagement rate of 34.4% against a dataset baseline of 5.17%. They share three structural traits in common.

↑ Top 1% (n=65)

96s

Median Duration

Long enough to demonstrate a product, short enough to retain attention. 86% are posted on TikTok. Median follower count: 233K.

↓ Everyone else

81s

Median Duration

Slightly shorter, materially less engaging. 44% TikTok. Median follower count: 88K. Pace (197 wpm) is nearly identical to the top 1%.

The platform mix is the largest single separator: 86% of top-1% videos sit on TikTok versus 44% of the rest. Pace, jargon density, and word count are nearly identical between cohorts. The data suggests that what founders are saying matters less than where they choose to say it.

02 · The Hook

The opening line: which hook archetypes travel.

We classified the opening words of every English-language video in the dataset into nine hook archetypes. The performance gap between the best- and worst-performing hook is roughly 35 percentage points in mean ER. The most common opener used by founders ("self-introduction") sits in the bottom third.

Hook archetype performance · avg ER%

English subset · n=6,538

Direct address ("You…")

5.83%

Story ("When I was…")

5.81%

Contrarian claim

5.70%

Announcement ("Launching…")

5.46%

Showcase ("Watch this…")

5.37%

Question ("What if…")

5.37%

"Other" / mixed

4.96%

Problem-first ("The problem is…")

4.75%

Self-intro ("Hi, my name is…")

4.33%

"We built…" / "I built…"

4.18%

Hooks framed around the viewer ("you") outperform hooks framed around the founder ("I built…", "Hi, my name is…") by 35–39% in mean ER. "The problem is…" - a common framing in pitch coaching - sits in the bottom third. The data suggests that what works in a pitch room does not transfer directly to short-form video.

Examples from the dataset

Real openings, real performance, pulled directly from the dataset's outliers.

"Stop writing your own prompts. Instead, let AI do it for you. Most people sit there spending 15, 20 minutes…"

@chriswinfield · Instagram · 138s ↑ 100% ER · top-1%

"Most people don't realize that Codex just dropped a massive new update that tons of people are looking…"

@ray_fu · TikTok · 65s ↑ 11.2% ER · 10.4M views

"This is the AI update I have been waiting for since I was 12. OpenAI just dropped a coding agent…"

@michellescomputer · TikTok · 88s ↑ 9.9% ER · 7.5M views

"Hi, my name is Valerie. I'm a fashion content creator. I was a data scientist, and then I started to…"

@valerie_yyy · TikTok ↓ Strong views, low ER 0.26%

"Hi everyone. So today I want to walk you through our product, which is an AI agent that helps with…"

Composite of the self-intro hook pattern (bottom quartile) ↓ Self-intro + soft preview = lowest-performing cluster

The pattern across the highest-performing openers is consistent: they make a claim that contradicts a conventional assumption, name the viewer rather than the founder, or promise something immediate. Founders trained for credentialed introductions appear to be operating with a different frame than the platform rewards.

From the data - Top 1% hook patterns

Specific phrase performance · ER%

Specific phrase test

"Watch this…"

7.64%

"In 30 seconds…"

6.38%

"Let me show you…"

6.25%

"We built…" / "I built…"

5.72%

"This is the…"

5.72%

"You should…" / "You need to…"

5.67%

"Imagine…"

5.47%

"The problem is…"

5.46%

Among the three opener phrasings we tested directly: "Watch this" records the highest mean ER (7.64%), ahead of "The problem is" (5.46%) and "We built" (5.72%). The gap between the best and worst of these three is roughly 40%.

03 · Anatomy

Length, pace, and content type.

Three structural variables - video duration, delivery speed (words per minute), and content theme - account for a substantial share of the engagement variance in this dataset. We examine each in turn.

Duration sweet spot · ER by length

All platforms · n=8,758

3–5 min

7.25%

90s–3 min

6.05%

5+ min

5.09%

60–90s

5.12%

Under 15s

4.97%

30–60s

4.22%

15–30s

4.19%

The "shorter is always better" heuristic does not hold in this dataset. The 90s–3min band records the strongest mean ER (6.05%); sub-30-second clips underperform (4.19% at 15-30s). One reading: audiences for technical content appear to self-select for longer-form engagement.

Words per minute · monotonic relationship

Higher pace = higher engagement

210+ wpm (n=2,447)

5.57%

180–210 wpm

5.26%

150–180 wpm

4.80%

120–150 wpm

4.32%

90–120 wpm

4.13%

Under 90 wpm (n=68)

3.52%

For reference: conversational English averages around 130 wpm and audiobooks target roughly 150. The top-performing AI founder videos in this dataset run at 200–235 wpm - closer to a podcast guest in flow than a keynote register. A measured cadence with deliberate pauses appears to underperform on short-form video, though we cannot rule out that "fast pace" is itself a proxy for higher production energy.

Content theme volume vs engagement

English subset · ER%

Fundraising

6.07%

Product demo

5.98%

Tool review

5.63%

Founder story

5.47%

Launch / "We just shipped"

5.15%

Tutorial / "how to"

5.08%

Industry take / prediction

5.05%

Coding / "vibe coding"

4.99%

Workflow automation

4.64%

Agentic AI / "AI agents"

4.31%

A note on the "AI agents" theme: it has the lowest mean ER of any major theme we tested in this dataset (4.31%). One interpretation is audience fatigue around abstract framing; another is that "agents" content tends to be more conceptual and less product-specific, and concrete demos appear to engage more reliably. We flag this as a finding worth replicating before drawing conclusions.

If we aggregate the top-decile traits into a single composite, the highest-engagement video in this dataset is on TikTok, runs roughly 110 seconds, opens with a contrarian or instructive line ("stop doing X", "watch this"), is delivered at 210+ words per minute, uses a caption under 50 characters, and closes without an explicit call to action.

The composite top-decile profile, derived from 8,758 videos

Two structural signals worth highlighting

↑ Short captions (under 50 characters)

6.69%

Average engagement rate

n = 533 videos. The relationship is monotonic across all five caption-length buckets we tested. A concise caption appears to read as native; a long one appears to read as promotional, even when it isn't.

↓ Long captions (500+ characters)

4.12%

Average engagement rate

n = 1,472 videos. The temptation to write a "mini essay" caption to add context appears to come with an engagement cost. A 62% gap separates the shortest and longest cohorts.

↑ No explicit CTA

5.24%

Average ER · 5,185 videos

Videos that close without asking for anything. One reading: the algorithm appears to weight non-promotional content more favourably, regardless of intent.

↓ Any explicit CTA

4.90%

Average ER · 1,353 videos

"Link in bio," "Try it free," "Sign up." Each phrase correlates with a 6–18% reduction in mean ER vs the no-CTA baseline. The "link in bio" formulation carries the largest penalty.

Both signals point in the same direction: content that looks and reads less like marketing performs better. There is an obvious causal story (audiences and algorithms both prefer non-promotional content) but also a confounding one (creators who happen to use shorter captions and no CTAs may simply be making different kinds of content). The data shows the pattern; the interpretation is contestable.

04 · Language and Vocabulary

Technical depth and its trade-offs.

We tagged the spoken transcript of every English-language video for technical jargon - agentic, multimodal, transformer, embeddings, inference, RAG, fine-tuning, latency, and similar terms - and counted occurrences. The relationship between jargon density and engagement is non-monotonic: a small amount is neutral or mildly positive; higher volumes are associated with lower engagement.

Jargon density · ER% by count of technical terms

English subset · n=6,538

1 jargon term

5.42%

Zero jargon

5.15%

2–3 terms

4.58%

4–6 terms

3.63%

7+ terms

2.37%

One technical term outperforms zero (5.42% vs 5.15%). Two or three already sit below baseline. From four upwards, the relationship turns sharply negative. A useful working principle drawn from the data: introduce at most one technical concept per video, and translate it explicitly. Each additional term beyond the first correlates with measurably lower engagement.

Accessibility phrases · "imagine," "it's basically," "in plain English"

Translation language wins

2+ accessibility phrases

6.02%

1 phrase

5.64%

Zero

5.10%

Videos that contain explicit translation phrases ("think of it like…", "in plain English…", "this basically means…") outperform videos that don't by roughly 18% in mean ER. This is one of the larger effect sizes we recorded for a single linguistic variable.

"AI washing" is real but solvable. The data shows audiences don't punish founders for using AI terminology - they punish founders for using AI terminology without explaining it. Say "agentic" once, then say what it means. Say it five times, lose the audience.

Implication for AI startup positioning

Which AI labs the conversation references

When AI founders make videos, they don't operate in a vacuum - they reference the platforms shaping the conversation. Here is the share of voice across the dataset.

AI brand co-mentions · videos detected

Spoken + caption · English subset

Anthropic / Claude

636

OpenAI / ChatGPT

624

Google / Gemini

186

Y Combinator

164

xAI / Grok

Cursor

Perplexity

Lovable

Runway

Microsoft / Copilot

Meta / Llama

Notion AI

Two labs dominate the named-mention conversation in this dataset: Anthropic/Claude (636 videos) and OpenAI/ChatGPT (624). xAI/Grok records 51 mentions despite high mainstream visibility for its leadership. The substantive AI conversation on TikTok and Instagram, by volume of mentions, is largely a conversation about two companies.

●

Shadow reach in this dataset: 191 of the 624 OpenAI / ChatGPT mentions appear only in spoken audio - the brand is named in the video but never written in the caption, hashtag, or @mention. For Anthropic / Claude: 172 of 636 mentions are spoken-only.

For a brand running competitive monitoring exclusively via caption-based listening tools, this represents roughly a third of the actual mention surface area in the dataset. The implication is methodological: any market-share or share-of-voice estimate based on text scraping alone will be a meaningful undercount for AI brands.

05 · Platform

TikTok and Instagram behave very differently.

The dataset splits roughly 55 / 45 between Instagram and TikTok. The engagement-rate gap between the two platforms is the largest single performance delta in this analysis - material enough that conclusions about content type or hook archetype need to be read in conjunction with platform.

TIKTOK

7.32%

Average engagement rate

3,926 videos · 869M views · median duration 90s. TikTok records 2.1x Instagram's mean engagement rate in this dataset, across a similar volume of content.

INSTAGRAM

3.42%

Average engagement rate

4,832 videos · 1.2B views · median duration 78s. Lower engagement per view but materially higher total reach. The platform appears to function more as a distribution surface than a community one for this category.

Duration behaves differently on each platform

TikTok · ER by duration bucket

Long form wins

3–5 min

8.78%

90s–3 min

7.98%

Under 15s

7.38%

60–90s

7.29%

15–30s

6.68%

30–60s

6.01%

5+ min

3.66%

TikTok now rewards substantial duration for AI/founder content. The 90s–5min range records the strongest mean ER on the platform - 7-12x longer than the 15-second clips TikTok built its reputation on. The platform's audience for technical content does not appear to behave like a short-attention-span audience.

Instagram · ER by duration bucket

Length still matters, less dramatically

5+ min

5.23%

3–5 min

4.18%

90s–3 min

4.18%

60–90s

3.49%

30–60s

3.19%

Under 15s

3.11%

15–30s

2.95%

Instagram's engagement curve by duration is flatter than TikTok's but skews slightly more toward longer formats. The Reels-as-short-form-clone era appears to be over for technical content in this dataset - sub-15-second clips meaningfully underperform 90s+ videos on both platforms.

Platform-specific patterns

TikTok: the strongest duration band is 90 seconds to 3 minutes

The median duration among top-1% TikTok videos is 100 seconds. The most-engaging pattern we observed is a fast opening hook, a concrete demonstration of 60–90 seconds, and a close without an explicit CTA. Talking-head footage layered with screen recording is the dominant winning format.

TikTok mean ER 7.32% · 86% of top 1% are on TikTok

Instagram: higher total reach, lower per-view engagement

Instagram accounts for 40% more total views than TikTok (1.2B vs 869M) on a similar content volume, but at roughly half the mean engagement rate. The data is consistent with using Instagram as a top-of-funnel surface - for awareness and broader distribution - while running deeper engagement and community-building on TikTok.

IG: 1.2B views · 3.42% mean ER

The platforms behave less like substitutes than like distinct channels with distinct functions. TikTok appears to be the better channel for comments, saves, and audience depth; Instagram appears to be the better channel for total reach. The same video, cross-posted without adaptation, will likely underperform its potential on at least one of the two.

A working hypothesis from the data

06 · Creators

Follower count is a weaker predictor than expected.

3,764 unique creators appear in this dataset. The follower-tier breakdown shows a non-monotonic relationship: mid-tier accounts (100K–1M followers) outperform macro accounts (1M+) on mean engagement, and a meaningful share of the highest-engagement videos come from accounts with under 50K followers.

Follower tier performance · ER%

English subset · n=6,538

Nano (1K–10K)

5.43%

Mid (100K–1M)

5.34%

Micro (10K–100K)

5.10%

Macro (1M+)

4.77%

Sub-1K

2.59%

Mean engagement decreases monotonically with follower size above 10K, with the sub-1K tier behaving anomalously (likely a distribution effect - very small accounts struggle to reach a meaningful audience at all). The 100K–1M tier records the strongest mean ER above the 10K floor, which on this evidence is the band worth examining most closely for partnerships.

Top 12 creators by total reach in this dataset

Sorted by views, 2+ videos

Creator	Platform	Videos	Total Views	Followers	Avg ER
@amazonwebservices	IG	9	49.3M	786K	0.80%
@dope.motions	IG	4	22.4M	779K	2.71%
@tiffintech	IG	13	12.9M	626K	5.97%
@nytimes	IG	4	12.9M	20.0M	5.06%
@ray_fu	TT	24	12.3M	126K	7.32%
@mytechceo	IG	7	10.2M	252K	4.62%
@vaibhavsisinty	IG	25	9.9M	1.6M	3.50%
@thevamshikurapati	IG	15	9.6M	912K	4.48%
@pikacodes	TT	2	7.1M	192K	3.26%
@manojsaru	IG	2	6.2M	2.1M	9.24%
@socho.abhi	IG	9	5.6M	307K	6.50%
@brycent	IG	65	6.0M	97K	3.95%

The top of the reach leaderboard is largely populated by accounts that cover AI and startup content (creators, news brands, AI commentators) rather than by founders themselves. Founder accounts tend to appear lower on the reach list but record higher mean engagement rates, consistent with the broader tier finding.

Mid-tier creators with above-average engagement

High engagement, mid-size following, multiple videos in our window. These are the creators a startup should be paying attention to before everyone else does.

High-ER mid-tier creators · 3+ videos, 10K+ engagement signal

Partnership shortlist

Creator	Platform	Videos	Total Views	Followers	Avg ER
@realestrhxtt	TT	3	1.1M	132K	40.77%
@adivunsolicited	TT	3	328K	221K	26.19%
@thedreydossier	TT	3	291K	484K	19.34%
@thotline_news	TT	4	138K	98K	19.12%
@chriswinfield	IG	14	471K	330K	18.69%
@hemu_rahman	TT	12	2.3M	118K	18.03%
@davejorgenson	TT	3	652K	175K	16.14%
@jthefrog0	TT	6	4.0M	327K	15.34%
@heyroyall	TT	3	2.6M	187K	14.15%
@longliveai	IG	17	11.8M	130K	12.84%

All but one of the top-10 mid-tier creators in this view are on TikTok. Engagement rates of 15-40% are unusually high and likely reflect a tight content-audience fit rather than something replicable through paid amplification - these accounts have built their audiences around AI/founder commentary specifically.

The AI brands that will define 2026 are not the ones with the biggest social agencies. They are the ones that figured out which 100-creator partnership map gets them inside the loops where AI is actually being talked about.

Strategic implication for AI startup distribution

07 · The Playbook

Seven structural observations, drawn directly from the data.

Each item below corresponds to a specific finding earlier in the report. The framing is observational, not prescriptive: these are the patterns we found in the data, with the caveat that any single video may succeed by ignoring all of them.

The self-introduction opener is a weak hook

Videos opening with "Hi, my name is…" or similar self-identification record a mean ER of 4.33% - 16% below the dataset average and 26% below the best-performing hook archetype (direct address). Founders who recognise their viewers can read their account handle and bio appear to use that screen space for something else: a contrarian claim, a piece of news, or a direct address to the viewer.

Self-intro hooks: 4.33% ER · Direct address: 5.83% ER · +35%

"Watch this" outperforms "We built…"

The specific phrase "Watch this" appears as an opener in 312 videos and records a mean ER of 7.64% - roughly 48% above the dataset average. The variant "We built…" records 5.72%. A reasonable interpretation: "watch this" promises immediate value to the viewer; "we built" centres the creator. The first frame appears to be where this distinction matters most.

"Watch this": 7.64% ER vs "We built": 5.72% ER · +34%

III

Faster pace correlates with higher engagement

The relationship between words-per-minute and ER is monotonic across the six buckets we tested. Videos delivered at 210+ wpm record 5.57% mean ER; videos under 90 wpm record 3.52%. The top 1% of videos average 196 wpm - closer to podcast tempo than to a keynote delivery. Whether this reflects audience preference, algorithmic preference, or a correlated variable (energy, editing) is unclear from the data.

210+ wpm: 5.57% ER · Under 90 wpm: 3.52% ER · +58%

The 90-second to 3-minute window outperforms shorter formats

On both platforms, videos in the 90s–3min duration band record higher mean ER than sub-30-second clips. On TikTok, the 3–5min band performs best (8.78% ER). The "shorter is always better" heuristic does not hold in this dataset for AI/founder content; one plausible reading is that audiences for technical material self-select for longer engagement.

90s–3min: 6.05% ER · 15-30s: 4.19% ER · +44%

One technical term works; multiple do not

Videos using exactly one technical term ("agentic", "RAG", "multimodal", "fine-tuning") slightly outperform videos with none (5.42% vs 5.15%). Each additional term beyond the first correlates with a decrease in mean ER, reaching 2.37% at seven or more terms. A useful proxy: introduce the term once, then translate it with a phrase like "think of it like…" - videos using two or more such translation phrases record 6.02% ER vs 5.10% for videos with none.

1 term + translation language: optimal · 7+ terms: 2.37% ER

Explicit calls-to-action correlate with lower engagement

Videos without an explicit CTA record 5.24% mean ER; videos with one record 4.90%. The phrase "link in bio" carries the largest individual penalty (4.08% ER, n=412). The causal direction is contestable - content that includes a CTA may also tend to be more transactional in tone overall - but the correlation across phrasings is consistent.

No CTA: 5.24% ER · "Link in bio": 4.08% ER · −22%

VII

TikTok and Instagram serve different objectives

TikTok records 2.1x the mean engagement rate of Instagram (7.32% vs 3.42%) in this dataset. Instagram, however, accounts for 40% more total views (1.2B vs 869M) across a similar volume of content. For founders, the implication is operational: the two platforms behave less like substitutes than like distinct channels with different functions - engagement on one, awareness on the other.

TT ER 7.32% · IG ER 3.42% · IG views +40% over TT

Aggregated across 8,758 videos, the patterns above point in one consistent direction. The format rewards directness, brevity in caption, depth in delivery, and an unscripted register. None of these are revelations on their own; the contribution of the dataset is that all four show up together, with reasonable sample sizes, across two platforms and three months of content.

Closing observation

RESEARCH METHODOLOGY

Multimodal video intelligence

8,758 qualified videos analysed across TikTok and Instagram

Engagement rate recomputed from raw counts. Hook archetypes classified by transcript pattern. Shadow reach detected via spoken-word audio analysis. Methodology and analysis prompt available via the buttons below.

Run this analysis on any vertical, brand,
or creator cohort with Oriane.

Book a demo →

oriane.xyz

An empirical view of how AI founders use short-form video.

The "Hi, my name is" opener underperforms

Faster delivery correlates with higher engagement

Short captions outperform long captions

What the highest-engagement 1% have in common

The opening line: which hook archetypes travel.

Examples from the dataset

Length, pace, and content type.

Two structural signals worth highlighting

Technical depth and its trade-offs.

Which AI labs the conversation references

TikTok and Instagram behave very differently.

Duration behaves differently on each platform

Platform-specific patterns

TikTok: the strongest duration band is 90 seconds to 3 minutes

Instagram: higher total reach, lower per-view engagement

Follower count is a weaker predictor than expected.

Mid-tier creators with above-average engagement

Seven structural observations, drawn directly from the data.

The self-introduction opener is a weak hook

"Watch this" outperforms "We built…"

Faster pace correlates with higher engagement

The 90-second to 3-minute window outperforms shorter formats

One technical term works; multiple do not

Explicit calls-to-action correlate with lower engagement

TikTok and Instagram serve different objectives

The Nine Searches

The Analysis Prompt