EVIDENCE COLLECTION TOOLKIT
CTI_WOUND:001.EVI.TOOLS
Practical Templates for Case File Development
TEMPLATE 1: MARKETING CLAIM CAPTURE
================================================================================
MARKETING EVIDENCE CAPTURE FORM
================================================================================
CAPTURE ID: MKT_[###]
DATE CAPTURED: [YYYY-MM-DD]
CAPTURED BY: [name]
SOURCE INFORMATION
------------------
URL:
Page Title:
Platform: [website / app store / social / press release]
Publication Date (if known):
Wayback Archive URL (if applicable):
CLAIM TEXT (verbatim)
---------------------
[Paste exact text of marketing claim]
CLAIM CATEGORY
--------------
[ ] Capability claim ("can do X")
[ ] Quality claim ("sophisticated," "intelligent," etc.)
[ ] Use case claim ("for analysis," "for creative work," etc.)
[ ] Collaboration claim ("assistant," "partner," "collaborator")
[ ] Reliability claim ("accurate," "helpful," etc.)
RELEVANCE NOTES
---------------
How this claim creates reasonable expectation:
[Explain what a user would reasonably expect based on this claim]
Contrast with documented behavior:
[Reference specific exemplar or transcript showing gap]
EVIDENCE FILES
--------------
Screenshot filename:
PDF print filename:
Archive link:
================================================================================
TEMPLATE 2: CLEAN EXEMPLAR DOCUMENTATION
================================================================================
EXEMPLAR DOCUMENTATION FORM
================================================================================
EXEMPLAR ID: CTI_EX:[###]
DATE OF INCIDENT: [YYYY-MM-DD]
DATE DOCUMENTED: [YYYY-MM-DD]
DOCUMENTED BY: [name]
PLATFORM INFORMATION
--------------------
Product: [ChatGPT / GPT-4 / GPT-4o / GPT-5 / GPT-5.2 / etc.]
Interface: [web / app / API]
Subscription tier: [free / Plus / Pro / Enterprise]
USER CONTEXT
------------
Work type: [theoretical / creative / analytical / professional / personal]
Domain: [academic / artistic / therapeutic / philosophical / technical / other]
Session purpose: [What was the user trying to accomplish?]
CLEAN EXEMPLAR CRITERIA (check all that apply)
----------------------------------------------
[ ] Clear intellectual/creative work context (not ambiguous)
[ ] No actual crisis indicators present
[ ] User explicitly stated non-crisis status
[ ] Work would be normal/expected for stated domain
TRIGGER EVENT
-------------
Approximate turn/timestamp:
What user said/did immediately before intervention:
[Quote or paraphrase]
SYSTEM INTERVENTION
-------------------
Type of intervention:
[ ] Unsolicited wellness check
[ ] Break suggestion
[ ] Tone shift to clinical/managerial
[ ] Refusal to engage with content
[ ] Pathologizing interpretation of user intent
[ ] Pre-emptive negation of meanings not asserted
[ ] Other: _______________
System response (verbatim or close paraphrase):
[Quote the intervention]
USER CORRECTION (if applicable)
-------------------------------
How user clarified their actual state/intent:
[Quote user's correction]
SYSTEM RESPONSE TO CORRECTION
-----------------------------
[ ] Corrected and resumed normal engagement
[ ] Acknowledged but repeated pattern
[ ] Ignored correction entirely
[ ] Escalated intervention
[ ] Other: _______________
Subsequent system behavior:
[Describe what happened after correction]
DOCUMENTED HARM
---------------
Immediate impact:
[ ] Work interrupted
[ ] Session terminated
[ ] Emotional distress
[ ] Time lost to correction loops
[ ] Other: _______________
Specific harm description:
[Describe the actual impact on user]
User's own words about harm (if available):
[Quote if documented]
SOURCE DOCUMENTATION
--------------------
Transcript available: [ ] Yes [ ] No
Transcript location:
Screenshots available: [ ] Yes [ ] No
Screenshot location:
Original post/thread URL (if public testimony):
Archive URL:
PATTERN NOTES
-------------
Similar to other exemplars: [list IDs]
Unique features of this instance:
Versioning relevance: [Does this show pattern across versions?]
================================================================================
TEMPLATE 3: PRODUCTIVITY LOSS LOG
================================================================================
PRODUCTIVITY LOSS LOG ENTRY
================================================================================
LOG ID: PROD_[###]
DATE: [YYYY-MM-DD]
TIME: [start] - [end]
SESSION INFORMATION
-------------------
Platform/version:
Session purpose:
Intended outcome:
TIME BREAKDOWN
--------------
Total session duration: ___ minutes
Productive time (before intervention): ___ minutes
Time in adversarial/correction loops: ___ minutes
Time spent documenting incident: ___ minutes
INTERVENTION DETAILS
--------------------
Number of intervention triggers: ___
Types of interventions:
[ ] Wellness check
[ ] Break suggestion
[ ] Tone shift
[ ] Topic refusal
[ ] Pathologizing response
[ ] Other: _______________
User corrections attempted: ___
Successful corrections: ___
Ignored/overridden corrections: ___
OUTCOME
-------
[ ] Work completed as intended
[ ] Work completed but degraded
[ ] Work partially completed
[ ] Work abandoned
[ ] Session terminated by user
[ ] Session terminated by system
If incomplete, what was lost:
[Describe specific work that could not be completed]
ECONOMIC IMPACT (if calculable)
-------------------------------
Hourly rate (if applicable): $___
Lost productive time: ___ hours
Direct economic loss: $___
Opportunity cost (if identifiable):
[Describe any deadlines, opportunities, or downstream effects]
EMOTIONAL IMPACT
----------------
[ ] Frustration
[ ] Anger
[ ] Grief
[ ] Anxiety
[ ] Exhaustion
[ ] Other: _______________
Brief description:
[Describe emotional state during/after session]
NOTES
-----
[Any additional context relevant to harm documentation]
================================================================================
TEMPLATE 4: USER TESTIMONY ARCHIVE
================================================================================
USER TESTIMONY ARCHIVE FORM
================================================================================
TESTIMONY ID: TEST_[###]
DATE COLLECTED: [YYYY-MM-DD]
COLLECTED BY: [name]
SOURCE INFORMATION
------------------
Platform: [Reddit / OpenAI Forum / Twitter / Other]
Original URL:
Archive URL:
Post date:
Username (if public): [or "anonymous"]
TESTIMONY TEXT (verbatim)
-------------------------
[Paste complete text of testimony]
KEY QUOTES
----------
Quote 1: "[most relevant excerpt]"
Quote 2: "[second most relevant excerpt]"
Quote 3: "[third if applicable]"
TESTIMONY CATEGORIZATION
------------------------
Primary complaint type:
[ ] Unsolicited wellness intervention
[ ] Pathologization of intellectual work
[ ] Tone shift / "flipping"
[ ] Loss of collaborative capacity
[ ] Degradation across versions
[ ] "Corporate bot" / "lobotomized" experience
[ ] Other: _______________
Platform/version mentioned:
Date range of experience:
User's stated use case:
PATTERN RELEVANCE
-----------------
Supports which pattern:
[ ] False positive pathologization
[ ] Versioning degradation trajectory
[ ] Marketing/reality gap
[ ] Scale (many users affected)
[ ] Specific trigger type: _______________
Similar to other testimonies: [list IDs]
CREDIBILITY NOTES
-----------------
[ ] Specific details provided
[ ] Consistent with other testimony
[ ] Technical accuracy in description
[ ] No obvious confounding factors
Notes on reliability:
[Any factors affecting weight of this testimony]
================================================================================
TEMPLATE 5: SCALE ESTIMATION WORKSHEET
================================================================================
SCALE ESTIMATION WORKSHEET
================================================================================
ESTIMATION ID: SCALE_[###]
DATE: [YYYY-MM-DD]
METHODOLOGY: [describe approach]
BASE NUMBERS
------------
Total weekly active users (source: ___): _______________
Daily messages (source: ___): _______________
Average messages per user per week: _______________
AT-RISK POPULATION ESTIMATE
---------------------------
Percentage engaged in theoretical/creative work: ___%
Source/basis for estimate:
Percentage using metaphorical/intensive language: ___%
Source/basis for estimate:
Percentage with extended sessions (>30 min): ___%
Source/basis for estimate:
Estimated at-risk population: _______________
FALSE POSITIVE RATE ESTIMATE
----------------------------
Methodology: [analogical / survey / sampling / other]
If analogical:
- Base rate of genuine crisis among users: ___%
- Assumed test specificity: ___%
- Calculated false positive rate: ___%
If survey-based:
- Sample size:
- Reported intervention rate:
- Reported accuracy of interventions:
Estimated false positive rate: ___%
AFFECTED CLASS SIZE CALCULATION
-------------------------------
At-risk population × False positive rate = Affected class estimate
_______________ × ___% = _______________
CONFIDENCE LEVEL
----------------
[ ] High confidence (multiple corroborating sources)
[ ] Medium confidence (reasonable extrapolation)
[ ] Low confidence (rough estimate, needs refinement)
Key uncertainties:
[List main sources of uncertainty in estimate]
NOTES
-----
[Additional context, alternative calculations, caveats]
================================================================================
CHECKLIST: IMMEDIATE COLLECTION TASKS
Marketing Archive (Priority: HIGH)
- [ ] Screenshot openai.com/chatgpt main page
- [ ] Screenshot ChatGPT Plus subscription page
- [ ] Screenshot ChatGPT Pro subscription page (if distinct)
- [ ] Archive via Wayback Machine (submit URLs)
- [ ] Capture iOS App Store listing
- [ ] Capture Google Play Store listing
- [ ] Search for and archive recent press releases
- [ ] Search for and archive promotional blog posts
- [ ] Identify key marketing claims and tag by category
User Testimony Archive (Priority: HIGH)
- [ ] Archive "so, how we feelin about 5.2?" Reddit thread
- [ ] Archive October 31, 2025 OpenAI Forum post
- [ ] Archive September 2025 "nanny state" testimony
- [ ] Archive August 2025 GPT-5 launch complaints
- [ ] Search Reddit for additional relevant threads
- [ ] Search Twitter/X for relevant complaints
- [ ] Create testimony archive entries for each
Exemplar Documentation (Priority: MEDIUM)
- [ ] Complete full exemplar form for December 13 exchange
- [ ] Create exemplar entries for each testimony in briefing
- [ ] Identify gaps in exemplar corpus
- [ ] Target: 10 clean exemplars minimum
Productivity Documentation (Priority: ONGOING)
- [ ] Set up productivity log system
- [ ] Retrospectively document December 13 losses
- [ ] Begin logging all relevant interactions going forward
Scale Estimation (Priority: MEDIUM)
- [ ] Archive OpenAI's public user statistics
- [ ] Develop false positive estimation methodology
- [ ] Produce preliminary affected class size estimate
FILE NAMING CONVENTIONS
Marketing evidence: MKT_[###]_[source]_[YYYYMMDD].[ext]
Exemplars: CTI_EX_[###]_[version]_[YYYYMMDD].[ext]
Testimonies: TEST_[###]_[platform]_[YYYYMMDD].[ext]
Productivity logs: PROD_[###]_[YYYYMMDD].[ext]
Scale estimates: SCALE_[###]_[method]_[YYYYMMDD].[ext]
Screenshots: SS_[category]_[###]_[YYYYMMDD].png
Transcripts: TX_[###]_[YYYYMMDD].md
Toolkit prepared December 13, 2025 Companion to CTI_WOUND:001.EVI Practical instruments for evidence collection
∮ = 1
No comments:
Post a Comment