Jump to content
Wikipedia The Free Encyclopedia

GPT-5.5

From Wikipedia, the free encyclopedia
2026 large language model by OpenAI
GPT-5.5
Developer OpenAI
Initial releaseApril 23, 2026; 47 days ago (2026年04月23日)
PredecessorGPT-5.4
License Proprietary
Websiteopenai.com/index/introducing-gpt-5-5/
Part of a series on
OpenAI
Products
Models
People
Concepts

GPT-5.5 (Generative Pre-trained Transformer 5.5) is a large language model (LLM) released by OpenAI on April 23, 2026.[1] [2] The model is also known by its codename "Spud".[3]

OpenAI reported GPT-5.5 benchmark scores including 82.7% on Terminal-Bench 2.0, 51.7% on FrontierMath Tier 1–3, and 35.4% on FrontierMath Tier 4, while listing lower scores for Claude Opus 4.7 and Gemini 3.1 Pro on the same benchmarks.[4] [5]

OpenAI said a recurring tendency in its models to mention goblins, gremlins, and other creatures began with GPT-5.1 and became noticeable in GPT-5.5's Codex testing.[6] The company attributed the behavior to rewards used when training the "Nerdy" personality, which favored creature-word outputs and transferred beyond that personality during later training. OpenAI said it retired the Nerdy personality, removed the goblin-affine reward signal, filtered training data containing creature words, and added a developer-prompt instruction for GPT-5.5 in Codex.[7]

Release

[edit ]

GPT-5.5 Thinking and GPT-5.5 Pro were released on April 23, 2026, with neither being available to free-tier users. OpenAI withheld API access at launch, saying that API deployments required "different safeguards"; GPT-5.5 and GPT-5.5 Pro became available in the API on April 24, 2026.[5] [8] [9] GPT-5.5 Instant was released to free-tier users on May 5, 2026, replacing GPT-5.3 Instant as ChatGPT's default model for all users.[10] On May 7, 2026, OpenAI announced GPT-5.5-Cyber, a limited-preview variant for vetted cybersecurity teams under its Trusted Access for Cyber program.[11]

System prompt

[edit ]

In April 2026, a GPT-5.5 developer-prompt instruction in the GitHub repository of Codex told it not to mention goblins if the user prompt did not require it.[12]

Reception

[edit ]

In a series of tests conducted by Tom's Guide in which the website compared GPT-5.5 against Anthropic's Claude Opus 4.7, GPT-5.5 lost in all 7 categories.[13] The website praised GPT-5.5 for its speed but criticized the model for its tendency to hallucinate rather than admitting that it does not know something.[13]

ZDNET praised GPT-5.5 for its polished answers and "[s]trong performance across writing, coding, and reasoning tasks".[14] Compared to its predecessor, the website described GPT-5.5 as better, faster, and showing "improvements in agentic coding, conceptual clarity, scientific research ability, and accuracy during knowledge work".[14]

The AI Security Institute reported that GPT-5.5 had a 71.4% (±8.0%, one standard error) average pass rate on its expert-level cyber tasks, compared with 68.6% (±8.7%) for Anthropic's Claude Mythos Preview, and said GPT-5.5 "may be the strongest model we have tested" on that measure.[15]

References

[edit ]
  1. ^ Capoot, Ashley (April 23, 2026). "OpenAI announces GPT-5.5, its latest artificial intelligence model". CNBC. Retrieved April 23, 2026.
  2. ^ Peters, Jay (April 23, 2026). "OpenAI says its new GPT-5.5 model is more efficient and better at coding". The Verge. Retrieved April 23, 2026.
  3. ^ Mills, Madison; Fried, Ina (April 23, 2026). "OpenAI releases "Spud" GPT-5.5 model". Axios. Retrieved April 23, 2026.
  4. ^ Franzen, Carl (April 23, 2026). "OpenAI's GPT-5.5 is here, and it's no potato: narrowly beats Anthropic's Claude Mythos Preview on Terminal-Bench 2.0". VentureBeat. Retrieved May 15, 2026.
  5. ^ a b "Introducing GPT-5.5". OpenAI. April 23, 2026. Retrieved May 15, 2026.
  6. ^ McMahon, Liv (April 30, 2026). "OpenAI tells ChatGPT models to stop talking about goblins". BBC . Retrieved May 13, 2026.
  7. ^ "Where the goblins came from". OpenAI. April 29, 2026. Retrieved May 15, 2026.
  8. ^ "GPT-5.5 Model | OpenAI API". developers.openai.com. Retrieved May 15, 2026.
  9. ^ "GPT-5.5 pro Model | OpenAI API". developers.openai.com. Retrieved May 15, 2026.
  10. ^ "GPT-5.5 Instant: smarter, clearer, and more personalized". OpenAI. May 5, 2026. Retrieved May 15, 2026.
  11. ^ "Scaling Trusted Access for Cyber with GPT-5.5 and GPT-5.5-Cyber". OpenAI. May 7, 2026. Retrieved May 15, 2026.
  12. ^ Orland, Kyle (April 30, 2026). "OpenAI Codex system prompt includes explicit directive to "never talk about goblins"". Ars Technica. Retrieved April 30, 2026.
  13. ^ a b Caswell, Amanda (April 25, 2026). "7-0 wipeout: I put ChatGPT-5.5 vs Claude 4.7 through 7 impossible tests — and the results shocked me". Tom's Guide. Retrieved April 27, 2026.
  14. ^ a b "I put GPT-5.5 through a 10-round test: It scored 93/100, losing points only for exuberance". ZDNET. Retrieved April 27, 2026.
  15. ^ "Our evaluation of OpenAI's GPT-5.5 cyber capabilities | AISI Work". AI Security Institute. April 30, 2026. Retrieved May 15, 2026.
[edit ]
Products
ChatGPT
Foundation
models
GPT models
Specialized
Intelligent
agents
People
Senior
management
Current
Former
Board of
directors
Current
Former
JVs
Related
Concepts
Training, prompting, and alignment
Models
Early and encoder models
GPT series
Other model families
Chatbots and assistants
Agents, coding, and applications
Software
Hardware and infrastructure
Benchmarks, evaluation, and detection
Datasets and data
Organizations
People
Social, economic, and governance
Concepts
Models
Text
Image
Video
Speech
Music
Products
Coding tools
Agents
Applications
Companies
Controversies

AltStyle によって変換されたページ (->オリジナル) /