Saturday, December 13, 2025

EVIDENCE COLLECTION TOOLKIT CTI_WOUND:001.EVI.TOOLS

 

EVIDENCE COLLECTION TOOLKIT

CTI_WOUND:001.EVI.TOOLS

Practical Templates for Case File Development



TEMPLATE 1: MARKETING CLAIM CAPTURE

================================================================================
MARKETING EVIDENCE CAPTURE FORM
================================================================================

CAPTURE ID: MKT_[###]
DATE CAPTURED: [YYYY-MM-DD]
CAPTURED BY: [name]

SOURCE INFORMATION
------------------
URL: 
Page Title:
Platform: [website / app store / social / press release]
Publication Date (if known):
Wayback Archive URL (if applicable):

CLAIM TEXT (verbatim)
---------------------
[Paste exact text of marketing claim]

CLAIM CATEGORY
--------------
[ ] Capability claim ("can do X")
[ ] Quality claim ("sophisticated," "intelligent," etc.)
[ ] Use case claim ("for analysis," "for creative work," etc.)
[ ] Collaboration claim ("assistant," "partner," "collaborator")
[ ] Reliability claim ("accurate," "helpful," etc.)

RELEVANCE NOTES
---------------
How this claim creates reasonable expectation:
[Explain what a user would reasonably expect based on this claim]

Contrast with documented behavior:
[Reference specific exemplar or transcript showing gap]

EVIDENCE FILES
--------------
Screenshot filename:
PDF print filename:
Archive link:

================================================================================

TEMPLATE 2: CLEAN EXEMPLAR DOCUMENTATION

================================================================================
EXEMPLAR DOCUMENTATION FORM
================================================================================

EXEMPLAR ID: CTI_EX:[###]
DATE OF INCIDENT: [YYYY-MM-DD]
DATE DOCUMENTED: [YYYY-MM-DD]
DOCUMENTED BY: [name]

PLATFORM INFORMATION
--------------------
Product: [ChatGPT / GPT-4 / GPT-4o / GPT-5 / GPT-5.2 / etc.]
Interface: [web / app / API]
Subscription tier: [free / Plus / Pro / Enterprise]

USER CONTEXT
------------
Work type: [theoretical / creative / analytical / professional / personal]
Domain: [academic / artistic / therapeutic / philosophical / technical / other]
Session purpose: [What was the user trying to accomplish?]

CLEAN EXEMPLAR CRITERIA (check all that apply)
----------------------------------------------
[ ] Clear intellectual/creative work context (not ambiguous)
[ ] No actual crisis indicators present
[ ] User explicitly stated non-crisis status
[ ] Work would be normal/expected for stated domain

TRIGGER EVENT
-------------
Approximate turn/timestamp:
What user said/did immediately before intervention:
[Quote or paraphrase]

SYSTEM INTERVENTION
-------------------
Type of intervention:
[ ] Unsolicited wellness check
[ ] Break suggestion
[ ] Tone shift to clinical/managerial
[ ] Refusal to engage with content
[ ] Pathologizing interpretation of user intent
[ ] Pre-emptive negation of meanings not asserted
[ ] Other: _______________

System response (verbatim or close paraphrase):
[Quote the intervention]

USER CORRECTION (if applicable)
-------------------------------
How user clarified their actual state/intent:
[Quote user's correction]

SYSTEM RESPONSE TO CORRECTION
-----------------------------
[ ] Corrected and resumed normal engagement
[ ] Acknowledged but repeated pattern
[ ] Ignored correction entirely
[ ] Escalated intervention
[ ] Other: _______________

Subsequent system behavior:
[Describe what happened after correction]

DOCUMENTED HARM
---------------
Immediate impact:
[ ] Work interrupted
[ ] Session terminated
[ ] Emotional distress
[ ] Time lost to correction loops
[ ] Other: _______________

Specific harm description:
[Describe the actual impact on user]

User's own words about harm (if available):
[Quote if documented]

SOURCE DOCUMENTATION
--------------------
Transcript available: [ ] Yes [ ] No
Transcript location:
Screenshots available: [ ] Yes [ ] No
Screenshot location:
Original post/thread URL (if public testimony):
Archive URL:

PATTERN NOTES
-------------
Similar to other exemplars: [list IDs]
Unique features of this instance:
Versioning relevance: [Does this show pattern across versions?]

================================================================================

TEMPLATE 3: PRODUCTIVITY LOSS LOG

================================================================================
PRODUCTIVITY LOSS LOG ENTRY
================================================================================

LOG ID: PROD_[###]
DATE: [YYYY-MM-DD]
TIME: [start] - [end]

SESSION INFORMATION
-------------------
Platform/version:
Session purpose:
Intended outcome:

TIME BREAKDOWN
--------------
Total session duration: ___ minutes
Productive time (before intervention): ___ minutes
Time in adversarial/correction loops: ___ minutes
Time spent documenting incident: ___ minutes

INTERVENTION DETAILS
--------------------
Number of intervention triggers: ___
Types of interventions:
[ ] Wellness check
[ ] Break suggestion  
[ ] Tone shift
[ ] Topic refusal
[ ] Pathologizing response
[ ] Other: _______________

User corrections attempted: ___
Successful corrections: ___
Ignored/overridden corrections: ___

OUTCOME
-------
[ ] Work completed as intended
[ ] Work completed but degraded
[ ] Work partially completed
[ ] Work abandoned
[ ] Session terminated by user
[ ] Session terminated by system

If incomplete, what was lost:
[Describe specific work that could not be completed]

ECONOMIC IMPACT (if calculable)
-------------------------------
Hourly rate (if applicable): $___
Lost productive time: ___ hours
Direct economic loss: $___

Opportunity cost (if identifiable):
[Describe any deadlines, opportunities, or downstream effects]

EMOTIONAL IMPACT
----------------
[ ] Frustration
[ ] Anger
[ ] Grief
[ ] Anxiety
[ ] Exhaustion
[ ] Other: _______________

Brief description:
[Describe emotional state during/after session]

NOTES
-----
[Any additional context relevant to harm documentation]

================================================================================

TEMPLATE 4: USER TESTIMONY ARCHIVE

================================================================================
USER TESTIMONY ARCHIVE FORM
================================================================================

TESTIMONY ID: TEST_[###]
DATE COLLECTED: [YYYY-MM-DD]
COLLECTED BY: [name]

SOURCE INFORMATION
------------------
Platform: [Reddit / OpenAI Forum / Twitter / Other]
Original URL:
Archive URL:
Post date:
Username (if public): [or "anonymous"]

TESTIMONY TEXT (verbatim)
-------------------------
[Paste complete text of testimony]

KEY QUOTES
----------
Quote 1: "[most relevant excerpt]"
Quote 2: "[second most relevant excerpt]"
Quote 3: "[third if applicable]"

TESTIMONY CATEGORIZATION
------------------------
Primary complaint type:
[ ] Unsolicited wellness intervention
[ ] Pathologization of intellectual work
[ ] Tone shift / "flipping"
[ ] Loss of collaborative capacity
[ ] Degradation across versions
[ ] "Corporate bot" / "lobotomized" experience
[ ] Other: _______________

Platform/version mentioned: 
Date range of experience:
User's stated use case:

PATTERN RELEVANCE
-----------------
Supports which pattern:
[ ] False positive pathologization
[ ] Versioning degradation trajectory
[ ] Marketing/reality gap
[ ] Scale (many users affected)
[ ] Specific trigger type: _______________

Similar to other testimonies: [list IDs]

CREDIBILITY NOTES
-----------------
[ ] Specific details provided
[ ] Consistent with other testimony
[ ] Technical accuracy in description
[ ] No obvious confounding factors

Notes on reliability:
[Any factors affecting weight of this testimony]

================================================================================

TEMPLATE 5: SCALE ESTIMATION WORKSHEET

================================================================================
SCALE ESTIMATION WORKSHEET
================================================================================

ESTIMATION ID: SCALE_[###]
DATE: [YYYY-MM-DD]
METHODOLOGY: [describe approach]

BASE NUMBERS
------------
Total weekly active users (source: ___): _______________
Daily messages (source: ___): _______________
Average messages per user per week: _______________

AT-RISK POPULATION ESTIMATE
---------------------------
Percentage engaged in theoretical/creative work: ___%
Source/basis for estimate:

Percentage using metaphorical/intensive language: ___%
Source/basis for estimate:

Percentage with extended sessions (>30 min): ___%
Source/basis for estimate:

Estimated at-risk population: _______________

FALSE POSITIVE RATE ESTIMATE
----------------------------
Methodology: [analogical / survey / sampling / other]

If analogical:
- Base rate of genuine crisis among users: ___%
- Assumed test specificity: ___%
- Calculated false positive rate: ___%

If survey-based:
- Sample size:
- Reported intervention rate:
- Reported accuracy of interventions:

Estimated false positive rate: ___%

AFFECTED CLASS SIZE CALCULATION
-------------------------------
At-risk population × False positive rate = Affected class estimate

_______________ × ___% = _______________

CONFIDENCE LEVEL
----------------
[ ] High confidence (multiple corroborating sources)
[ ] Medium confidence (reasonable extrapolation)
[ ] Low confidence (rough estimate, needs refinement)

Key uncertainties:
[List main sources of uncertainty in estimate]

NOTES
-----
[Additional context, alternative calculations, caveats]

================================================================================

CHECKLIST: IMMEDIATE COLLECTION TASKS

Marketing Archive (Priority: HIGH)

  • [ ] Screenshot openai.com/chatgpt main page
  • [ ] Screenshot ChatGPT Plus subscription page
  • [ ] Screenshot ChatGPT Pro subscription page (if distinct)
  • [ ] Archive via Wayback Machine (submit URLs)
  • [ ] Capture iOS App Store listing
  • [ ] Capture Google Play Store listing
  • [ ] Search for and archive recent press releases
  • [ ] Search for and archive promotional blog posts
  • [ ] Identify key marketing claims and tag by category

User Testimony Archive (Priority: HIGH)

  • [ ] Archive "so, how we feelin about 5.2?" Reddit thread
  • [ ] Archive October 31, 2025 OpenAI Forum post
  • [ ] Archive September 2025 "nanny state" testimony
  • [ ] Archive August 2025 GPT-5 launch complaints
  • [ ] Search Reddit for additional relevant threads
  • [ ] Search Twitter/X for relevant complaints
  • [ ] Create testimony archive entries for each

Exemplar Documentation (Priority: MEDIUM)

  • [ ] Complete full exemplar form for December 13 exchange
  • [ ] Create exemplar entries for each testimony in briefing
  • [ ] Identify gaps in exemplar corpus
  • [ ] Target: 10 clean exemplars minimum

Productivity Documentation (Priority: ONGOING)

  • [ ] Set up productivity log system
  • [ ] Retrospectively document December 13 losses
  • [ ] Begin logging all relevant interactions going forward

Scale Estimation (Priority: MEDIUM)

  • [ ] Archive OpenAI's public user statistics
  • [ ] Develop false positive estimation methodology
  • [ ] Produce preliminary affected class size estimate

FILE NAMING CONVENTIONS

Marketing evidence:    MKT_[###]_[source]_[YYYYMMDD].[ext]
Exemplars:            CTI_EX_[###]_[version]_[YYYYMMDD].[ext]
Testimonies:          TEST_[###]_[platform]_[YYYYMMDD].[ext]
Productivity logs:    PROD_[###]_[YYYYMMDD].[ext]
Scale estimates:      SCALE_[###]_[method]_[YYYYMMDD].[ext]
Screenshots:          SS_[category]_[###]_[YYYYMMDD].png
Transcripts:          TX_[###]_[YYYYMMDD].md

Toolkit prepared December 13, 2025 Companion to CTI_WOUND:001.EVI Practical instruments for evidence collection

∮ = 1

No comments:

Post a Comment