I love this approach, very Diamond Age. I uploaded a picture of my cat and learned a lot about coat genetics. I think it fixated on his coloration.
vova_hn2
The demo on the main page suggests "Paris Travel Overview / Visiting Notre Dame", so I tried a couple of cities and places that I've actually been to.
If often identifies some of the points of interest correctly, but their spacial location relative to each other is completely insane. Like, not even close to reality.
giobox
I just asked it to create a torque spec diagram of the suspension for my car, a subject I'm pretty familiar with. It amazingly drew everything correctly, displayed the correct torque figures and allowed me to click on individual components to zoom in further, providing more specs.
Genuinely one of the most impressive demos I've tried in a long time. I was able to use it almost like a living version of a classic illustrated Haynes workshop manual.
show comments
martianlantern
Cool project, but just a side thought I was having about how do people have resources and the money to make things like this and make it avl for public, I mean it's fair to say they have their own GPUs or if they are using api keys for gpt or Gemini with enterprise subsidized inference
But still coming from a frugal background I still cannot wrap my head around this
Ah I was thinking this created the webpage itself, which I always thought was an interesting concept. Some future where the application is crafted in realtime to fulfill the needs of the user. Has anyone made something like this?
Interesting idea, but just about everything is failing for me. Probably the HN hug of death happening.
Gemini generateContent request failed: { "error": { "code": 429, "message": "You exceeded your current quota, please check your plan and billing details. For more information on this error, head to: https://ai.google.dev/gemini-api/docs/rate-limits. To monitor your current usage, head to: https://ai.dev/rate-limit. ", "status": "RESOURCE_EXHAUSTED", "details": [ { "@type": "type.googleapis.com/google.rpc.Help", "links": [ { "description": "Learn more about Gemini API quotas", "url": "https://ai.google.dev/gemini-api/docs/rate-limits" } ] } ] } }
Congrats on the launch! This is an amazing product.
Something I would add is a panel with sources in case someone wants to have a deep dive in the information.
My 2 cents: This could be transformed to a state of the art teacher kind of product
tomaskafka
I applaud the idea and the technological part of that, but of course, as with all AI models, it produces utter slop once you go even a tiny bit outside of the learning data.
For example, here I asked for a mechanism of converting the circular motion to wing flapping, and it has no idea what to do at all.
The core problem with using AI to learn is that if you don't know about the specific area (which you don't, that's why you are learning it), it can (and absolutely will) fake knowledge without you ever noticing it, and be utterly wrong, disseminating false info, and teaching you (or your kids) wrong world models.
saberience
It's not really a website streamed live at all... It's just asking a model to generate an image with some web searches plus its built-in (trained knowledge)
You can do this with any agentic coding model now... just ask OpenAI codex : generate me an image given this query: "the query" and you will get the same kind of outputs.
squibonpig
Very cool as a demo. I tried something information-dense, a poker pre-flop chart for a specific stack depth (40BB BTN vs UTG rfi) and it was about what I expected. It doesn't even resemble a poker chart and there's no salvageable information as far as I can tell. Not really something this should be able to do though.
I typed in the address of my childhood home, and breathed a sigh of relief when it showed a random home with solar panels and 'clean modern sustainable living' which my childhood home was not. Even added solar panels.
General design was correct, and it included the name of a town just nearby.
Not a surprising result, but made me reflect on what a weird world we now live in.
gyomu
I love this because it articulates so well a precise vision of a world I don’t want to live in.
This is built from the collective works of all humans throughout history who have strived to make infographics, illustrations, and communicate knowledge - with 0 actual credit or reference to them (or financial compensation, if they’re still alive).
Instead, who is making money from this? Google, as providers of the model - and maybe the founders of this product, if they ever choose to monetize it somehow.
I’m not even going to get into how the results it produces have just enough “insight” to appear valid but the moment you inspect it up close, it’s completely wrong in most details, and replete with ornamentation that doesn’t actually add to meaning - a Potemkin village of knowledge - because the common answer to this criticism around here is “just wait 6 months bro the models will definitely solve all those problems”.
We are not better off investing billions of dollars in computers doing this over paying humans to write and illustrate and make cultural artefacts. We are not better off putting this in the hands of kids rather than meaningfully designed resources & curriculum designed by humans.
What are we even doing.
That’s the biggest problem with the current wave of AI tooling - it’s so easy to make a cool demo all while completely missing the point of what actually is good for human flourishing.
ianand
It's like "GPT is all you need for the backend" [1] on steroids
Great demo, interesting transitions and UI, but the model / generated information is definitely not correct.
Legend2440
Interesting idea and cool demo.
For this to really be practical you'd need a way to run networks many times faster and more efficiently than today's GPUs. This is too slow to work even with cloud GPUs powering it.
Maybe someday.
jdthedisciple
I like the visual style it produces, great for educational material. What is it called?
__MatrixMan__
This is fun. I started with "all hail the glow cloud" and now I'm clicking to wander around Nightvale. It's not exactly suprrising that it knows all of the lore, but it paints a pretty cohesive picture...
otterpro
It's pretty cool. I created a beautiful isometric illustration of home garden, which is worthy of being featured in a real book or magazine. I really like the isometric view to explain things, and the color palette is consistent and pleasant.
deviantony
Very cool project ! I fear this might have a pretty high hallucination potential (with current models) the deeper you dig into the base image/context and clicking on potentially unrelated elements in the image.
Nevertheless, love the idea.
[deleted]
sentientslug
This is not really working for me at all, the second images always look near identical to the first with some minor changes. Maybe my prompts are the issue? Anyone have some good prompts?
show comments
namanvyas
Couldn't get it to load (probably getting hammered right now) but the concept is interesting. Feels like one of those things where the tech needs to get 10x cheaper before it actually makes sense as a product.
docheinestages
Great idea and execution!
brohan90
This is one of the more unique ideas i've encountered in a long time
show comments
dnnddidiej
Fun. Uploaded a Kookaburra and got an Encarta like experience zooming on different things.
4ndrewl
It looks pretty nice - reminds me of Dorling Kindersley books. But the graphics, whilst stylised, are pretty hit-and-miss. Great idea, just a bit too soon.
markusw
This is so good and so fun! :D
singingtoday
The images I upload are displayed with an incorrect aspect ratio.
Neat project though!
farmeroy
I kind of find this absolutely infuriating for reality, but super fun for diagrams of things like 'interdimensional subcutaneous engineering' or whatever scifi/fantasy word salad you want to throw at it
readitalready
This kind of thing would be great if we could have large local models sometime in the future.
Isharmla
Incredible AI visualization tool
dh1011
Very cool idea. Wish it could render faster.
wxw
So cool! Love the exploration into new interfaces.
ZeidJ
This would make an amazing educational tool
victorbjorklund
This is just epic. Really amazing.
gardenhedge
Game changer when the technology catches up
moralestapia
This is real nice, wow. Congratulations.
This very well could be a sneak-peek into how educational resources might look like in the future.
iJohnDoe
So cool! People being critical of it not being accurate, but from a technology concept it’s super awesome.
tristor
This is very cool, if a bit glitchy right now (probably thanks to HN popularity). I used to this to generate infographics of the rear subframe, diff carrier, and rear suspension of my car and to get detailed specifications on the bushings, suspension members, and other components. Most of the information matches what I already know, and could be really useful if trained specifically on manufacturer/dealer shop manuals to create interactive models of vehicles you can drill to and get part numbers and specifications for any component on a car.
CrzyLngPwd
The worst part of this sort of slop is the attention it squanders by being glacially slow.
In the age of such enormous computing power, this sort of thing is pure waste.
MS Encarta CDs were faster and more in-depth.
show comments
DonHopkins
This wins the internet.
I went from Cat Photos into History of Victorian Cat Photos With Props like Miniature Tea Sets And Velvet Chairs And Humorous Captions On Calling Cards In Visually Ironic Aristocratic Cooperplate Font The Victorian Meme Script With High Stakes Expectations Anchored In A World With Human Dignity As It Relates To Modern Memes in just a few clicks.
Oddly specific, but that was exactly what I needed to see today.
I love this approach, very Diamond Age. I uploaded a picture of my cat and learned a lot about coat genetics. I think it fixated on his coloration.
The demo on the main page suggests "Paris Travel Overview / Visiting Notre Dame", so I tried a couple of cities and places that I've actually been to.
If often identifies some of the points of interest correctly, but their spacial location relative to each other is completely insane. Like, not even close to reality.
I just asked it to create a torque spec diagram of the suspension for my car, a subject I'm pretty familiar with. It amazingly drew everything correctly, displayed the correct torque figures and allowed me to click on individual components to zoom in further, providing more specs.
Genuinely one of the most impressive demos I've tried in a long time. I was able to use it almost like a living version of a classic illustrated Haynes workshop manual.
Cool project, but just a side thought I was having about how do people have resources and the money to make things like this and make it avl for public, I mean it's fair to say they have their own GPUs or if they are using api keys for gpt or Gemini with enterprise subsidized inference
But still coming from a frugal background I still cannot wrap my head around this
I don't know what everyone's complaining about, it seems perfectly accurate to me: https://flipbook.page/n/8e369afba8ea4563a739b58105c3c3d8
Heh, looks like it's not in its own training data: https://flipbook.page/n/d739a0bbc3664ba2aad331c90fef7406
Ah I was thinking this created the webpage itself, which I always thought was an interesting concept. Some future where the application is crafted in realtime to fulfill the needs of the user. Has anyone made something like this?
Sneed's Feed and Seed (Formerly Chuck's)
https://flipbook.page/n/4a5e1797903b478c876a35e64c6c57fe
Interesting idea, but just about everything is failing for me. Probably the HN hug of death happening.
wow - truly impressive.
Mac Neo, featuring 2x M4 quantum chips, solid state battery and graphene connector. https://flipbook.page/n/942776fea47c4274a9a4589134924ef5
This seems like an expensive product to subject to the HN hug of death.
The sample videos on the tweet are very very cool.
Unfortunately it didn’t really work for me, I’ll try it out in a few days when the traffic’s died down.
Fun. But just as flawed as any LLM.
https://flipbook.page/n/6bd29678afc04b799276f4d3d62cc18e
Text is still hard
screenshot: https://images2.imgbox.com/ff/84/j2FCxyrD_o.png - top right callout
Congrats on the launch! This is an amazing product. Something I would add is a panel with sources in case someone wants to have a deep dive in the information. My 2 cents: This could be transformed to a state of the art teacher kind of product
I applaud the idea and the technological part of that, but of course, as with all AI models, it produces utter slop once you go even a tiny bit outside of the learning data.
For example, here I asked for a mechanism of converting the circular motion to wing flapping, and it has no idea what to do at all.
https://flipbook.page/n/21f96ba33aa94852bc1f567bc5cd23bf
The core problem with using AI to learn is that if you don't know about the specific area (which you don't, that's why you are learning it), it can (and absolutely will) fake knowledge without you ever noticing it, and be utterly wrong, disseminating false info, and teaching you (or your kids) wrong world models.
It's not really a website streamed live at all... It's just asking a model to generate an image with some web searches plus its built-in (trained knowledge)
You can do this with any agentic coding model now... just ask OpenAI codex : generate me an image given this query: "the query" and you will get the same kind of outputs.
Very cool as a demo. I tried something information-dense, a poker pre-flop chart for a specific stack depth (40BB BTN vs UTG rfi) and it was about what I expected. It doesn't even resemble a poker chart and there's no salvageable information as far as I can tell. Not really something this should be able to do though.
https://flipbook.page/n/d48526ab345c4880a3b2171785508f52
I typed in the address of my childhood home, and breathed a sigh of relief when it showed a random home with solar panels and 'clean modern sustainable living' which my childhood home was not. Even added solar panels.
General design was correct, and it included the name of a town just nearby.
Not a surprising result, but made me reflect on what a weird world we now live in.
I love this because it articulates so well a precise vision of a world I don’t want to live in.
This is built from the collective works of all humans throughout history who have strived to make infographics, illustrations, and communicate knowledge - with 0 actual credit or reference to them (or financial compensation, if they’re still alive).
Instead, who is making money from this? Google, as providers of the model - and maybe the founders of this product, if they ever choose to monetize it somehow.
I’m not even going to get into how the results it produces have just enough “insight” to appear valid but the moment you inspect it up close, it’s completely wrong in most details, and replete with ornamentation that doesn’t actually add to meaning - a Potemkin village of knowledge - because the common answer to this criticism around here is “just wait 6 months bro the models will definitely solve all those problems”.
(some of the things I’ve tried for reference)
https://flipbook.page/n/56fed5ac8e164467b1d6151a6d5068ae
https://flipbook.page/n/deeb4d846d1a44738aa70d8973fc5765
https://flipbook.page/n/335fb5d4c4d8428d82e8a43fc4f7a4e8
We are not better off investing billions of dollars in computers doing this over paying humans to write and illustrate and make cultural artefacts. We are not better off putting this in the hands of kids rather than meaningfully designed resources & curriculum designed by humans.
What are we even doing.
That’s the biggest problem with the current wave of AI tooling - it’s so easy to make a cool demo all while completely missing the point of what actually is good for human flourishing.
It's like "GPT is all you need for the backend" [1] on steroids
[1] https://news.ycombinator.com/item?id=34503418
Beyond the training boundary. Seems a propos for "an infinite visual browser". I'm just not clear on how we get to the beyond part of it.
https://flipbook.page/n/f8982ddfd3ef4cbcb2ad8449d7d049b6
Didn't quite nail the labeling of each piece correctly for a small form factor PC build:
https://flipbook.page/n/12267bbfdeb043c3aa477337950b2b71
- M2 is labeled as GPU
- GPU is labeled as M.2 and RAM?
- RAM is labeled as GPU
- Random plant inside the case?
- This is also not a typical layout for a SFF PC
Great demo, interesting transitions and UI, but the model / generated information is definitely not correct.
Interesting idea and cool demo.
For this to really be practical you'd need a way to run networks many times faster and more efficiently than today's GPUs. This is too slow to work even with cloud GPUs powering it.
Maybe someday.
I like the visual style it produces, great for educational material. What is it called?
This is fun. I started with "all hail the glow cloud" and now I'm clicking to wander around Nightvale. It's not exactly suprrising that it knows all of the lore, but it paints a pretty cohesive picture...
It's pretty cool. I created a beautiful isometric illustration of home garden, which is worthy of being featured in a real book or magazine. I really like the isometric view to explain things, and the color palette is consistent and pleasant.
Very cool project ! I fear this might have a pretty high hallucination potential (with current models) the deeper you dig into the base image/context and clicking on potentially unrelated elements in the image. Nevertheless, love the idea.
This is not really working for me at all, the second images always look near identical to the first with some minor changes. Maybe my prompts are the issue? Anyone have some good prompts?
Couldn't get it to load (probably getting hammered right now) but the concept is interesting. Feels like one of those things where the tech needs to get 10x cheaper before it actually makes sense as a product.
Great idea and execution!
This is one of the more unique ideas i've encountered in a long time
Fun. Uploaded a Kookaburra and got an Encarta like experience zooming on different things.
It looks pretty nice - reminds me of Dorling Kindersley books. But the graphics, whilst stylised, are pretty hit-and-miss. Great idea, just a bit too soon.
This is so good and so fun! :D
The images I upload are displayed with an incorrect aspect ratio.
Neat project though!
I kind of find this absolutely infuriating for reality, but super fun for diagrams of things like 'interdimensional subcutaneous engineering' or whatever scifi/fantasy word salad you want to throw at it
This kind of thing would be great if we could have large local models sometime in the future.
Incredible AI visualization tool
Very cool idea. Wish it could render faster.
So cool! Love the exploration into new interfaces.
This would make an amazing educational tool
This is just epic. Really amazing.
Game changer when the technology catches up
This is real nice, wow. Congratulations.
This very well could be a sneak-peek into how educational resources might look like in the future.
So cool! People being critical of it not being accurate, but from a technology concept it’s super awesome.
This is very cool, if a bit glitchy right now (probably thanks to HN popularity). I used to this to generate infographics of the rear subframe, diff carrier, and rear suspension of my car and to get detailed specifications on the bushings, suspension members, and other components. Most of the information matches what I already know, and could be really useful if trained specifically on manufacturer/dealer shop manuals to create interactive models of vehicles you can drill to and get part numbers and specifications for any component on a car.
The worst part of this sort of slop is the attention it squanders by being glacially slow.
In the age of such enormous computing power, this sort of thing is pure waste.
MS Encarta CDs were faster and more in-depth.
This wins the internet.
I went from Cat Photos into History of Victorian Cat Photos With Props like Miniature Tea Sets And Velvet Chairs And Humorous Captions On Calling Cards In Visually Ironic Aristocratic Cooperplate Font The Victorian Meme Script With High Stakes Expectations Anchored In A World With Human Dignity As It Relates To Modern Memes in just a few clicks.
Oddly specific, but that was exactly what I needed to see today.