from Hacker News

Apps SDK

by alvis on 10/6/25, 6:27 PM with 382 comments

  • by sert_121 on 10/7/25, 1:11 AM

    It's interesting to see how chatgpt is becoming more and more of a starting point of the web exploration, at which they're like, why even bother searching at this point, we'll just have default workflows for maps, buy (integration of stripe already marks it), booking airlines etc, which covers so much basic stuff people would do anyways.

    The biggest bottleneck for this for the past two years imo wasn't the models, but the engineering and infra around it, and the willingness of companies to work with openaio directly. Now that they've grown and have a decent userbase, companies are much more willing to pay/or involve themselves in these efforts.

    This has eventual implications outside user-heavy internet use (once we see more things built on the SDK), where we're gonna see a fork in the web traffic of human centric workflows through chat, and an seo-filled, chat/agent-optimized web that is only catered to agents. (crossposted)

  • by fidotron on 10/6/25, 7:23 PM

    This conception makes sense iff you believe in ChatGPT as the universal user interface of the future. If anything the agentic wave is showing that the chat interfaces are better off hidden behind stricter user interface paradigms.
  • by rushingcreek on 10/6/25, 6:46 PM

    I think this is very interesting, but it is reminiscent of what we built with Phind 2 where the answer could include dynamic, pre-built widgets.

    The problem with this approach is precisely that these apps/widgets have hard-coded input and output schema. They can work quite well when the user asks something within the widget's capabilities, but the brittleness of this approach starts showing quickly in real-world use. What if you want to use more advanced filters with Zillow? Or perhaps cross-reference with StreetEasy? If those features aren't supported by the widget's hard-coded schema, you're out of luck as a user.

    What I think it much more exciting is the ability to completely create generative UI answers on the fly. We'll have more to say on this soon from Phind (I'm the founder).

  • by mhl47 on 10/6/25, 6:49 PM

    There was a recent post here about how deeply ingrained the chat interface is in OpenAIs organization. This really doubles down on that, but does anyone really like to interact with so much language instead of visual elements? Also feels horrible that you are supposed to remember a bunch of app names like "zillow" and punch them in the chat. And like an opportunity for them to slowly introduce ads for this apps or "preferential discovery", if you will, as monetization strategy.

    Personally I don't hope thats the future.

  • by emilsedgh on 10/6/25, 7:05 PM

    I see a lot of negative comments here but to me, it was obvious this is where OAI should land.

    They want to be the platform in which you tell what you want, and OAI does it for you. It's gonna connect to your inbox, calendar, payment methods, and you'll just ask it to do something and it will, using those apps.

    This means OAI won't need ads. Just rev share.

  • by ed on 10/7/25, 12:54 AM

    A bit underwhelming when you see what's actually on offer. "Apps" are really just MCP servers, with an extension to allow returning HTML.

    A lot of the fundamental issues with MCP are still present: MCP is pretty single-player, users must "pull" content from the service, and the model of "enabling connections" is fairly unintuitive compared to "opening an app."

    Ideally apps would have a dedicated entry point, be able to push content to users, and have some persistence in the UI. And really the primary interface should be HTML, not chat.

    As such I think this current iteration will turn out a lot like GPT's.

  • by hubraumhugo on 10/6/25, 7:45 PM

    Why does everyone think chat is better UX than traditional interfaces? I get the AI hype, but so many products are not a fit for chat interfaces.

    Why would I use a chat to do what could be done quicker with a simple and intuitive button/input UX (e.g. Booking or Zillow search/filter)? Chat also has really poor discoverability of what I can actually do with it.

  • by cefboud on 10/6/25, 6:52 PM

    This is an interesting branding exercise. Presenting MCP as 'Apps' makes it sound more accessible, while tools and MCP server sound very technical. Add a demo with Expedia and Spotify and you have an MCP that's end-user ready.
  • by fny on 10/6/25, 7:10 PM

    It’s remarkable that will inevitably rush to build free apps that only reinforce OpenAI’s moat while cannibilizing their own opportunities.
  • by darajava on 10/6/25, 9:44 PM

    I don't understand, what could be built with this platform that wouldn't be made obsolete by conceivable updates to ChatGPT?

    Another commenter suggested a hotel search function:

    > Find me hotels in Capetown that have a pool by the beach .Should cost between 200 dollars to 800 dollars a night

    ChatGPT can already do this. Similarly, their own pizza lookup example seems like it would exist or nearly exist with current functionality. I can't think of a single non-trivial app that could be built on this platform - and if there are any, I can't think of any that would be useful or not in immediate danger of being swallowed by advances to ChatGPT.

  • by bonoboTP on 10/6/25, 8:43 PM

    This is part of the fight regarding whether we will have utility apps inside the chat app or chatboxes inside the utility apps. Obviously OpenAI would prefer that they are in the driver seat and delegate to passive apps, while regular apps like Booking would prefer to be the app the user uses and to run an AI chatbox nested inside their own app UI, so they can swap it out etc.

    Convenience-wise probably this model is more viable, and things will get centralized to the AI apps. And the nested utilities will be walled gardens on steroids. Using custom software and general computing (in the manner of the now discontinued sideloading on Android) will get even further away for the average person.

  • by wiradikusuma on 10/6/25, 7:16 PM

    In 2018, I founded a startup specializing in chatbot for events. At the time the platforms were Alexa Skills, Actions on Google, and Messenger Platform (and LINE Bot, for people in Asia). I guess what's old is new again, but with fancier tech.

    This time will be different?

  • by WillieCubed on 10/6/25, 7:46 PM

    It's poetic that Google attempted to pursue apps within Google Assistant years ago, but the vision of apps within an AI assistant is more feasible now with LLMs that (whether actually or not) understand arbitrary user intents and more flexible connectors to third party apps via MCP (and a viral platform with 700+ million weekly active users).

    Custom GPTs (and Gemini gems) didn't really work because they didn't have any utility outside the chat window. They were really just bundled prompt workflows that relied on the inherent abilities of the model. But now with MCP, agent-based apps are way more useful.

    I believe there's a fundamentally different shift going on here: in the endgame that OpenAI, Anthropic et al. are racing toward, there will be little need for developers for the kinds of consumer-facing apps that OpenAI appears to be targeting.

    OpenAI hinted at this idea at the end of their Codex demo: the future will be built from software built on demand, tailored to each user's specific needs.

    Even if one doesn't believe that AI will completely automate software development, it's not unreasonable to think that we can build deterministic tooling to wrap LLMs and provide functionality that's good enough for a wide range of consumer experiences. And when pumping out code and architecting software becomes easy to automate with little additional marginal cost, some of the only moats other companies have are user trust (e.g. knowing that Coursera's content is at least made by real humans grounded in reality), the ability to coordinate markets and transform capital (e.g. dealing with three-sided marketplaces on DoorDash), switching costs, or ability to handle regulatory burdens.

    The cynic in me says that today's announcements are really just a stopgap measure to: - Further increase the utility of ChatGPT for users, turning it into the de facto way of accessing the internet for younger users à la how Facebook was (is?) in developing countries - Pave the way for by commoditizing OpenAI's complements (traditional SaaS apps) as ChatGPT becomes more capable as a platform with first-party experiences - Increase the value of the company to acquire more clout with enterprises and other business deals

    But cynicism aside, this is pretty cool. I think there's a solid foundation here for the kind of intent-based, action-oriented computing that I think will benefit non-technical people immensely.

  • by Illniyar on 10/6/25, 10:44 PM

    I can't understand the documentation. How are the interactive elements embedded in the chat? Are they just iFrames?

    The docs mention returning resources, and the example is returning a rust file as a resource, which is nonsensical.

    This seems similar to MCP UI in result but it's not clear how it works internally.

  • by ttoinou on 10/6/25, 6:47 PM

    Does anyone think small players (like an independent developer) will be accepted ? Sounds like it will only for the big whales
  • by LudwigNagasena on 10/6/25, 11:50 PM

    Glad to see no AGI hubris in this presentation, but we also haven’t see anything groundbreaking: their own version of GUI plugins, their own version of a workflow builder, and an aspiration to take cut of every transaction on the web.

    I hope their GUI integration will be eventually superseded by native UI integration. I remember such well thought out concepts dating back to 2018 (https://uxdesign.cc/redesigning-siri-and-adding-multitasking...).

  • by spullara on 10/6/25, 6:51 PM

    We have been building MCP servers and this looks very good directionally. Fills a bunch of holes in the protocol and gives meaning to something that were kind of like placeholders. Being able to return UI to the client is fantastic and will make lots of things possible. We have been working on these kinds of things assuming that the clients would improve to meet us.

    https://lukew.com/ff/entry.asp?2122

  • by MaxPock on 10/6/25, 7:15 PM

    This is honestly useful.

    "Find me hotels in Capetown that have a pool by the beach .Should cost between 200 dollars to 800 dollars a night "

  • by pu_pu on 10/7/25, 3:04 AM

    This really feels like a missed opportunity to build something genuinely new, something that actually plays to the strengths of LLMs, instead of just embedding a fixed set of app screens inside chat.

    Ideally, users should be able to describe a task, and the AI would figure out which tools to use, wire them together, and show the result as an editable workflow or inline canvas the user can tweak. Frameworks like LlamaIndex’s Workflow or LangGraph already let you define these directed graphs manually in Python where each node can do something specific, branch, or loop. But the AI should be able to generate those DAGs on the fly, since it’s just code underneath.

    And given that LLMs are already quite good at generating UI code and following a design system (see v0.app), there’s not much reason to hardcode screens at all. The model can just create and adapt them as needed.

    Really hope Google doesn’t follow OpenAI down this path.

  • by MaxPock on 10/6/25, 7:04 PM

    Tencent already has this with WeChat.Good to see it on chatgpt finally
  • by whinvik on 10/6/25, 8:56 PM

    Ads. They created ads. Now (or eventually) they can charge app developers to be featured first for a specific use case.
  • by benatkin on 10/6/25, 6:49 PM

    They're looking like Facebook did with their phone project and later the metaverse - too big for their britches.
  • by ed on 10/7/25, 12:27 AM

    Anyone able to get this to work?

    Lots of folks (myself included) are reporting it doesn't: https://github.com/openai/openai-apps-sdk-examples/issues/1

  • by skeeter2020 on 10/6/25, 9:46 PM

    Seems wild to have an App SDK for a technology that's 1. supposed to free us from purpose-built APIs and interfaces, and 2. comprised entirely of a single textbox. Feels perhaps more like a MS-type strategy of standards and formal rules intended to lock down the extended ecosystem?
  • by naiv on 10/6/25, 6:43 PM

    Remember "GPTs" and the thing before it which I don't even remember now. I think this will go the same route .. to nowhere
  • by outlore on 10/6/25, 9:49 PM

    remember when custom GPTs would just need an OpenAPI spec to be compatible with any existing API out there? we've been through this app store journey once before, maybe it's different this time since we now have agents and MCP
  • by sailfast on 10/7/25, 12:48 AM

    Why would I want to enable OpenAI to collect an Apple Tax from me down the road?

    Sure, this helps app partners access their large user base and grows their functionality too - but the end game has to be lock-in with a 30% tax right?

  • by aryehof on 10/8/25, 5:56 AM

    I wonder if I have just seen the future. A movement away from mobile apps (and some aspects of websites), to apps in an AI model?

    Can’t say I'm unhappy to see the authoritarian duopoly of the existing app stores challenged.

    One question that comes to mind is how will multiple providers of similar products and services be recommended/discovered? Perhaps they wont be recommended, but just listed instead as currently done by search engines. Is AISO our future - AI Search Optimization?

  • by alganet on 10/7/25, 12:00 AM

    Developers, developers, developers!

    https://www.youtube.com/watch?v=8fcSviC7cRM&t=34s

  • by petecapecod on 10/7/25, 12:18 PM

    Hey Sam that's a mighty fine moat you just put up around your castle Or wall if you like that metaphor better.

    While Apps do sound and look like the future, I feel like we're headed down the same road as the App and Google Play stores with this. Sooner or later OpenAI is going to use this to take a cut $$ of the payments going through the system. Which they most likely need and deserve, but still any time you close off part of the web it makes the web less open and free.

  • by irrationalfab on 10/6/25, 7:17 PM

    This feels like the death of the app, and the rise of the micro-app.
  • by mercury24aug on 10/10/25, 7:52 PM

    It's funny how OpenAI announced Apps SDK without the SDK. Anyway, we was so excited to get my hands dirty that we built our own SDK: https://github.com/fractal-mcp/sdk
  • by itsnowandnever on 10/6/25, 7:34 PM

    this seems kinda silly, especially given their previous app store flop. but I'm just happy there's some spark and competition in tech again. it's felt like the industry has been pretty stagnant since web 2.0 (more stagnant than any other time in the last 40-50 years, anyway). but this AI stuff feels like another "1977 Trinity" moment

    so, best of luck to OAI. we'll see how this plays out

  • by disiplus on 10/6/25, 6:58 PM

    Honestly I see how somebody like kayak.com would build a "app" they work through commission, they don't care from where is the booking coming from. But they will sort the flight tickets based where do they earn the best commission. What's in there for me as a user ?. Also will openai let different providers pay for the top placement when somebody tries to buy ticket on chatgpt ?
  • by chvid on 10/6/25, 6:48 PM

    Discovery, monetization. What is in it for developers?
  • by mightymosquito on 10/7/25, 11:45 AM

    I really think this is Open AIs opening the eco system moment which is equivalent to google opening up Android or facebook allowing gaming platforms like zynga to grow on their platform.

    To me it seems like a strategic shift from pure AI research and the AGI snake oil to other supposed tangible stuff.

    In short, the AI revolution is mostly over, and we seem to be back in the realm of software.

  • by helloguillecl on 10/6/25, 8:37 PM

    Chat offers a far better experience than using Google—no more searching through spam-filled results, clicking between sponsored links, accepting endless cookie banners, and trying to read a tiny bit of useful content buried among ads and clutter.

    It has the potential to bridge the gap between pure conversation and the functionality of a full website.

  • by Dig1t on 10/6/25, 10:59 PM

    Just let the AI control my mouse and keyboard, let it use my device like a human. There's a huge swath of software already designed to be used by humans and anyone who uses ChatGPT knows that it's already been trained on every scrap of knowledge on how to use any existing complex software.
  • by dawnerd on 10/7/25, 2:52 AM

    I’ve still yet to see how this improves anything? I saw someone mentioning it can use Spotify. Okay but like so can older gen assistants. Seems like they’re just trying to sell a much more expensive way of doing something that already exists.
  • by spullara on 10/6/25, 10:46 PM

  • by ttoinou on 10/6/25, 7:08 PM

    That’s a great idea and Im wondering if Telegram can follow this path too, since they’re so advanced in mobile UX / UI, constantly updating their app and have some kind of crypto payments support.
  • by Handy-Man on 10/6/25, 6:51 PM

    This is them trying to build ChatGPT into platform, from which they will take some portion of revenue generated by these apps...hmm where have I seen this before.
  • by doppelgunner on 10/7/25, 8:23 AM

    I think voice or chat is the best interface for AI tools because you don’t need to learn how to use them. We already do it every day.
  • by saberience on 10/6/25, 8:39 PM

    What is the incentive for developers to build apps for this platform? I don't see any way of monetizing them at all.
  • by nextworddev on 10/6/25, 7:42 PM

    Your SaaS / Business is my Tool
  • by melodyogonna on 10/7/25, 8:20 AM

    Soon they'll start serving ads, you just know they're eying Google's lunch
  • by todotask2 on 10/7/25, 2:21 PM

    One interesting I found, the docs, is using Astro Starlight.
  • by danjl on 10/6/25, 7:06 PM

    If only this somehow resulted in fewer, better apps. <sigh>
  • by defraudbah on 10/7/25, 7:51 AM

    lol, their github is filled with "got the same issue" comments, imaging debugging and teaching your users how to use a blackbox
  • by nthypes on 10/6/25, 9:07 PM

    chat is the best interface for information retrieval and REPL-like experiences. for all the rest, chat is horrible.
  • by mirzap on 10/6/25, 7:58 PM

    Is it just me, or does it seem odd that if you truly believed AGI would be achieved within a few years, you wouldn’t launch an app store for AI apps? I don’t think an app store makes any sense in a post-AGI world.
  • by hamonrye on 10/7/25, 2:38 AM

    1GK AMD chips will accelerate
  • by compacct27 on 10/6/25, 6:42 PM

    “Build our platform for us!”
  • by tonysurfly on 10/7/25, 7:03 AM

    This is a great idea.
  • by siva7 on 10/6/25, 7:27 PM

    This feels like a fever dream. As a developer everything changes every week. A new model, a new tool, a new sdk, paradigm we have to learn. I'm getting tired of all that shit.
  • by OtherShrezzing on 10/6/25, 7:58 PM

    OpenAI launched an App Store in Nov 2023. A 23 month turnaround from major feature launch, to deprecation, to relaunch is a commitment to product longevity that’d put Google to shame.
  • by alvis on 10/6/25, 6:46 PM

    So it’s take 2 for Open AI’s App Store moment. But this time surfing Anthropic’s MCP wave. Smart interop.. or just chasing the cool kids?
  • by AlfredBarnes on 10/7/25, 12:38 PM

    People prefer no ads, that's why its easy to dip into chatgpt get a good enough answer and avoid the rest of the enshitification of every website.
  • by klysm on 10/7/25, 1:31 AM

    I guess openai is trying to execute the google playbook?
  • by jasonsb on 10/6/25, 6:45 PM

    They promised AGI and delivered SDKs. I think I'm gonna skip this one..
  • by testfrequency on 10/6/25, 6:58 PM

    Wow.

    “CEO” Fidji Simo must really need something to do.

    Maybe I’m cynical about all of this, but it feels like a whole lot of marketing spin for an MCP standard.

  • by throwacct on 10/6/25, 7:49 PM

    Yeah... no. I'm going to pass. The premise is bad from any angle. In the case of businesses, why "create" another "Amazon" and compete with other brands when the focus should be on getting customers through my sales funnel? For developers is much worse since they are going to copy Amazon's model with brands that found a niche: Amazon Basics. In this case, it'll be OpenAI "core" (or something like that), where you do all the work, and when your "app" is somewhat famous enough or getting traction, they'll copy it, rebrand it, and bombard all old and new customers to use it instead of yours.

    I'mma call it now just for the fun of it: This will go the way of their "GPT" store.

  • by darkwater on 10/6/25, 7:33 PM

    Oh, I guess tomorrow when American HQs come online we will get some new shiny thing barely tested that needs to be deployed in production ASAP. Or maybe there is already something waiting for me in Slack...
  • by markab21 on 10/6/25, 6:49 PM

    The skepticism is understandable given the trajectory of GPTs and custom instructions, but there's a meaningful technical difference here: the Apps SDK is built on the Model Context Protocol (MCP), which is an open specification rather than a proprietary format.

    MCP standardizes how LLM clients connect to external tools—defining wire formats, authentication flows, and metadata schemas. This means apps you build aren't inherently ChatGPT-specific; they're MCP servers that could work with any MCP-compatible client. The protocol is transport-agnostic and self-describing, with official Python and TypeScript SDKs already available.

    That said, the "build our platform" criticism isn't entirely off base. While the protocol is open, practical adoption still depends heavily on ChatGPT's distribution and whether other LLM providers actually implement MCP clients. The real test will be whether this becomes a genuine cross-platform standard or just another way to contribute to OpenAI's ecosystem.

    The technical primitives (tool discovery, structured content return, embedded UI resources) are solid and address real integration problems. Whether it succeeds likely depends more on ecosystem dynamics than technical merit.