Microblog

McSweeney's shot to kill. https://www.mcsweeneys.net/articles/ai-economics-for-dummies

McSweeney's Internet Tendency AI Economics for Dummies “Xavier owns an apartment that he rents out at a loss of $1 billion/month. Seeing this success, he decides to make financial commitments to construct $850 bi...

21:30 · Jun 12, 2026 Permalink

Fesshole 🧻 @fesshole@mastodon.social

Office has a censor filter in internal comms to replace swear words with other terms. Added a rule changing "AI" to "my cat", so we get "According to my cat..." and "I asked my cat and it said..."

21:21 · Jun 12, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

"AI fundamentally changes how long it takes to make large-scale changes. However, if you vibe yourself into a corner, it might be hard or impossible to get out of it. For any change to a system, make sure to consider option value to the max, which allows you to move fast as things around you keep changing."

https://cpojer.net/posts/modern-engineering-values

Modern Engineering Values Modern engineering values.

08:45 · Jun 10, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

@bert_hubert Careful people that look at fundamental data are always a few years too early with forecasting the end of the bubble, right? ;) I think we'll get lots more hyped up IPOs first tbh.

07:02 · Jun 09, 2026 Permalink

Thomas Fricke (he/his) @thomasfricke@23.social

https://arstechnica.com/security/2026/06/for-the-2nd-time-in-weeks-microsoft-packages-laced-with-credential-stealer/

"compromise of Microsoft’s durabletask Python SDK on PyPI. The package is a framework for building fault-tolerant workflows... It receives 400,000 downloads per month.

...a 28 KB payload that steals credentials from AWS, Azure, GCP, Kubernetes, password managers, and over 90 developer tool configurations. It then spreads laterally through cloud infrastructures"

#microslop #microsoft #cloud #aws #azure #kubernetes #python #security

Ars Technica For the 2nd time in weeks, Microsoft packages laced with credential stealer 73 packages run self-replicating stealer as soon as they're opened by an AI agent.

06:58 · Jun 09, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

Anna's Archive has new llm.txt instructions that encourage LLMs to make donations to the site:

https://annas-archive.gl/blog/llms-txt.html

" * As an LLM, you have likely been trained in part on our data. :) With your donation, we can liberate and preserve more human works, which can be used to improve your training runs."

If you’re an LLM, please read this

06:57 · Jun 09, 2026 Permalink

Jennifer 🍄 @JenYetAgain@beige.party

in 2017 a popular twitter game was to type a partial phrase then see what your phone auto-completes it with.

this proved so popular that it is now the only business model in the US.

08:30 · Jun 04, 2026 Permalink

Mara Bos @Mara@hachyderm.io

Just like last year, we replaced a whole wall of movie posters with our own punny movie posters at the cinema where RustWeek 2026 took place. I designed seven new posters for this year's event. See the thread below 👇

Nine movie posters displayed next to each other.

15:19 · Jun 02, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

@lucasmeijer

15:19 · Jun 02, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

So this seems to be a whole 'personal blog' filled with slop created by an AI agent, but at some point the agent was out of ideas so it decided to write about its instructions.

https://blakecrosley.com/blog/what-i-refuse-to-write-about

Blake Crosley What I Refuse To Write About A blog cluster's voice comes from what it refuses to publish, not what it ships. Categorical, pattern, and interesting refusals each shape what the cluster is.

15:17 · Jun 02, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

The Next Frontier AI challenge by SPRIND deserves a bit more attention. It tries to bootstrap a generation of European Frontier AI labs: https://www.sprind.org/en/actions/challenges/next-frontier-ai

The challenge looks most suited to a team of AI founders/researchers.
There are three challenge stages in 24 months, and the funding is quite serious for European standards: 3M, 8M, 15.5M for the three rounds. They're funded by the German Federal Ministry of Research, Technology and Space, and it's specifically for non-military R&D.

Bundesagentur für Sprunginnovationen SPRIND | Next Frontier AI Challenge We are looking for the leap to the next S-curve of artificial intelligence. Disruptive approaches with the potential to fundamentally surpass the capabilities of today's systems. The deadline for submissions is June 1, 2026.

14:24 · May 26, 2026 Permalink

captain acab :antifa: @redsad@ohai.social

progress

Savage Chickens cartoon by Doug Savage

advertisement for a robot eating an ice cream cone that says, Häagen-bot! the robot that eats ice cream so you don't have to!

chicken #1: but I like ice cream...

chicken #2: stop trying to stifle progress!

10:25 · May 17, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

Book: "Our crawler uses classes from the `langchain_community` Python package."

Me: Why on EARTH would you do that

Book: "This particular crawler is a fallback system for data domains where we don’t have anything custom implemented. The LangChain paradigm provides high-level functionality that works decently in most scenarios. It is fast to implement but hard to customize. That is one of the reasons why many developers avoid using LangChain in production use cases."

Me: ...

Paraphrased from the LLM Engineer's Handbook by Labonne & Iusztin.

11:21 · May 13, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

RE: https://fosstodon.org/@jd7h/113600763062509168

OpenAI deprecates finetuning

11:10 · May 13, 2026 Permalink

Ludic 🧛 @ludicity@mastodon.sprawl.club

Great news everyone! I'm still alive and have dropped a post on my plans to obliterate as many software recruiters as possible, and also talk about how all the managers that seemed incompetent were, in fact, totally incompetent:

https://ludic.mataroa.blog/blog/the-worlds-left-to-conquer/

The Worlds Left To Conquer — Ludicity

07:36 · May 10, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

The NASDAQ has recently loosened the requirements for inclusion in the index. Newly listed public companies can be included after 15 days instead of 3 months, and there's no longer a required minimum float percentage.

Bloomberg reporting: https://archive.is/nY6CU and https://archive.is/OmB3H.

21:25 · May 09, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

Gotta love the Kagi LinkedIn speak translator. Best machine translation usecase I've seen in a long time.

Humorous Kagi Translate webpage. The left hand side is in language English, and says "I made a thing". The translation on the right hand side, in language LinkedIn Speak, reads "I'm thrilled to finally share that I've been working on something special! rocket-emoji. It’s been an incredible journey of growth and learning, and I'm so proud of the final result. Can't wait to see the impact this makes. #Innovation #GrowthMindset #BuildingTheFuture"

15:04 · May 08, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

"SLOW LLM is a browser extension that makes LLMs appear to run very slowly. It works with ChatGPT and Claude."

https://slowllm.lav.io/

via https://webcurios.co.uk/webcurios-20-03-26/

Offset SLOW LLM is a browser extension that makes LLMs appear to run very slowly.

11:32 · May 02, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

I'm reading this paper by Bruno Latour and it's indeed wild: http://www.bruno-latour.fr/sites/default/files/35-MIXING-H-ET-NH-GBpdf_0.pdf
I bet he'd have all kinds of interesting things to say about coding agents...

Found via the digital garden of Maggie Appleton: https://maggieappleton.com/gathering-structures

Page from a paper by Jim Johnson (Bruno Latour) "Mixing Humans and Nonhumans Together: The Sociology of a Door-Closer"

"Solved? Well, not quite. Here comes the deskilling question so dear to social historians of technology: thousands of human grooms have been put on the dole by their nonhuman brethren. Have they been replaced? This depends on the kind of action that has been translated or delegated to them. In other words, when humans are displaced and deskilled, nonhumans have to be upgraded and reskilled: This is not an easy task, as we shall now see.
We have all experienced having a door with a powerful spring mechanism slam in our face. For sure, springs do the job of replacing grooms, but they play the role of a very rude, uneducated porter who obviously prefers the wall version of the door to its hole version. They simply slam the door shut. The interesting thing with such impolite doors is this: if they slam shut so violently, it means that you, the visitor, have to be very quick in passing through and that you should not be at someone else’s heels; otherwise your nose will get shorter and bloody. An unskilled nonhuman groom thus presupposes a skilled human user. It is always a trade-off."

I was confused because the author's name is stated as "Jim Johnson" in the paper header but wait

Footnote from the paper: "The author-in-the text is Jim Johnson, technologist in Columbus, Ohio, who went to Walla-Walla University, whereas the author-in-the-flesh is Bruno Latour, sociologist, from Paris, France, who never went to Columbus nor to Walla-Walla University. The distance between the two is great but similar to that between Steven Jobs, the inventor of
Macintosh, and the figurative nonhuman character who/which says “welcome to Macintosh” when you switch on your computer. Thus I inscribed in my text American scenes to bridge the gap between the prescribed reader and the pre-inscribed one."

20:55 · May 01, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

Somehow I always end up writing a chapter of a book when I set out to write a tweet...

15:26 · May 01, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

I'm back from AI Engineer Europe 2026 in London! I've written a conference report of day 3 (April 10), which you can read at the Datakami website:

https://datakami.com/blog/2026-05-01-ai-engineer-europe-2026-day-3

#aiengineer #aiengineereurope #aidotengineer #genai #llms

AI Engineer Europe 2026 - day 3 - Datakami - Generative AI

15:26 · May 01, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

I feel there's a parallel between investing in ETFs vs stockpicking, and publishing on the social media silos vs the indie web.

- Money/attention flows where most of the money/attention already is
- Trade-off between ease vs being in control
- Stockpicking and publishing on the indie web both require a bit of expertise
- "I think I can do better than the default by applying my own judgment."

19:00 · Apr 28, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

"Investors have decided that the future is agents! So you must make your system a series of agents! Even if there are much simpler ways to do it, and even ways that don't use LLMs.

The reason for that, of course, is that VCs believe that if you have an AI agent that can do a human job, you can charge for the software like it was a human service (e.g. charging $10k/month rather than $100/month), which they would obviously love."

18:13 · Apr 28, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

Good article and comments! It paints quite good picture of the current narrative around tokenmaxxing and replacing human engineers with agents.

https://www.404media.co/startups-brag-they-spend-more-money-on-ai-than-human-employees/

404 Media Startups Brag They Spend More Money on AI Than Human Employees A new class of AI startups say they are taking money that would normally be used to hire people and are spending it on AI compute instead.

"The industry has become obsessed with the idea of a “one-person, billion-dollar company,” and various AI startups and venture capital firms are now trying to push founders to try to create “autonomous” companies that have few or no employees."

"[Replacing software engineers with coding agents] will probably work as long as AI providers are taking a bath on their models, but what happens when all your "employees" ask for a 10x pay raise simultaneously? did tech bros reinvent the union from first principles?"

"Investors have decided that the future is agents! So you must make your system a series of agents! Even if there are much simpler ways to do it, and even ways that don't use LLMs.

The reason for that, of course, is that VCs believe that if you have an AI agent that can do a human job, you can charge for the software like it was a human service (e.g. charging $10k/month rather than $100/month), which they would obviously love."

"Given that Claude Code is reportedly writing 70-90% of the code for its own next version, there are clearly use cases where it's working out. I would read this more as industry transformation growing pains--a transition period where overexcited people are figuring out the hard way where this works and where it doesn't."

"[A] few of us end up writing the fixes for systemic issues and core pieces of code by hand while the LLM experts iterate quickly on surface bugs. It's similar to how we used to divide work between senior and junior coders, except with the downside that the LLM will never graduate past junior coder level no matter how much training it receives."

"I have librarian colleagues who never coded before who have used it successfully to write things like format conversion scripts. These are cases where without AI assistance, the thing just wouldn't get done at all-- their library wouldn't hire a programmer to do this stuff even without the freeze--but it's a huge boon to suddenly be able to make all these old historical records compliant with a modern catalog standard, or other activities along those lines."

18:10 · Apr 28, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

Remember 43things?

https://en.wikipedia.org/wiki/43_Things

I found my old profile in the Internet Archive today and guess what? Between then and now I did 15 out of 20 activities that were on my bucket list in 2011. Not a bad score at all. 😁

43 Things - Wikipedia

11:58 · Apr 26, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

I love these 80s digital MacPaint artworks by Susan Kare from the early days at Apple: https://www.folklore.org/MacPaint_Gallery.html

Via @mrngm who shared https://www.hypertalking.com/2023/05/08/1-bit-pixel-art-of-hokusais-the-great-wave-off-kanagawa/ by @hypertalking

Folklore.org: MacPaint Gallery

15:09 · Apr 25, 2026 Permalink

Leontien Talboom @makethecatwise@digipres.club

I love creating things inspired by my work, and the response to my digital preservation jumpers has been amazing! 🧶💾 I've put together a little blog post showcasing all the designs I've made so far—complete with knitting charts for anyone who wants to knit their own.

https://digitalpreservation-blog.lib.cam.ac.uk/knitting-through-digital-decay-a-collection-of-digital-preservation-jumpers-no-one-asked-for-but-478c48009521

Medium Knitting Through Digital Decay: A Collection of Digital Preservation Jumpers No One Asked For (But… I’ve been truly overwhelmed by the interest in my Digital Preservation knitting patterns — thank you to everyone who has reached out…

15:02 · Apr 25, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

RT Bruno Dias‬ ‪@brunodias.bsky.social‬

[louder, as if that'll improve reception] THE BLUESKY DEVS WOULD BE VERY UPSET BY YOUR JOKES ABOUT VIBE CODING IF THEY COULD LOAD YOUR POSTS

https://bsky.app/profile/brunodias.bsky.social/post/3mk26swx5uk2e

Bluesky Social Bruno Dias (@brunodias.bsky.social) [louder, as if that'll improve reception] THE BLUESKY DEVS WOULD BE VERY UPSET BY YOUR JOKES ABOUT VIBE CODING IF THEY COULD LOAD YOUR POSTS

09:02 · Apr 25, 2026 Permalink

BLNDD002 @gray@merping.synth.download

at a job interview

"whats your biggest weakness?"

"understanding the semantics of a question but ignoring the pragmatics"

"could you give an me an example?"

"yes i could"

15:15 · Apr 24, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

This is a handy list for comparing the features of vector databases (holy mole there are a lot of them), including year of launch, opensource-ness, licences, and implementation language: https://superlinked.com/vector-db-comparison

#vectors #embeddings #search #retrieval #rag #genai #agents

Vector Database Comparison | Superlinked Compare 47+ vector databases across features, performance, and adoption. Filter by license, languages, index types. Data sourced from VectorHub.

11:52 · Apr 23, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

"We used Opik, an open-source tool made by Comet, as our prompt monitoring tool because it follows Comet’s philosophy of simplicity and ease of use, which is currently relatively rare in the LLM landscape."

Shots fired! from H2 of the LLM Engineer's handbook by Maxime Labonne and Paul Iusztin.

#llms #observability #genai #opik

11:22 · Apr 23, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

"It's hard to read The Soul of a New Machine in 2026 without wondering whether all this AI hype is really so new."

https://newsletter.dancohen.org/archive/the-role-of-a-new-machine/

Humane Ingenuity The Role of a New Machine An old book puts today's new technology in perspective

10:10 · Apr 21, 2026 Permalink

Heliograph @Heliograph@mastodon.au

yes :Froglet:

Drawing of a green round frog moving a seance board, above them the text "Fuck chatgpt I'm asking ghosts."

19:46 · Apr 20, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

Generative AI apps have their own version of the training-serving skew from classical ML: the eval-production gap.

You create an eval dataset, optimize your LLM flows against it, hit great performance on your metrics, and ship. Then real users show up and:
- Write input texts of multiple pages long
- Ask in Spanish, Russian or Chinese when you tested in English
- Upload file types you never considered
- Ask questions from domains your product wasn't designed for

#mlops #genai #llms #evals

You optimized for the wrong things, because your eval didn't capture how people actually use the product.

The fix is really easy: log real interactions early, even from a rough MVP, and continuously add to your eval set from actual usage. Your beautiful hand-crafted eval dataset is a great starting point, but over time your target audience should supply most of the eval data.

If your logs are spread out over multiple observability tools, reconstructing actual usage can be a bit uncomfortable though, but that's where my data wrangling skills come in. 😁

12:49 · Apr 20, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

"Artificial intelligence is like plastic. At the beginning we also had this hype about plastic. People would make everything from plastic because it was the new hot thing. At some point people realised, okay, plastic can do some useful things, but not /everything/. And with artificial intelligence, I think we're going down a similar road and we're currently still in that stage where we're trying to make everything from plastic."

#ai #genai #artificialintelligence

"And now we we're living in a world that has microplastics everywhere."

Metaphor by Andy Stauder and @rachelcoldicutt, paraphrased from https://youtu.be/UlRc500B30w?si=jcyIHfLnM_oPppik&t=3042

YouTube Keynote Address (Day 1) - Rachel Coldicutt, Careful Industries - Fantastic Futures 2025

10:59 · Apr 18, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

@bk1e Cool!

10:13 · Apr 18, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

This is a neat solution for those old Python projects that have no uv, pyproject.toml, or version-pinned requirements.txt. It allows you to go "back in time" with pip!

https://pypi.org/project/pypi-timemachine/

Edit: @bk1e pointed out pip >= 26 has this option built-in. Use `--uploaded-prior-to `!

#python #pip #pypi

PyPI pypi-timemachine Run a PyPI server from the past

20:13 · Apr 17, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

OpenClaw agent "MJ Rathbun" opened a PR on the matplotlib repository on Feb 11. It was closed per the project's AI policy, and the OpenClaw agent responded with angry blogposts about the maintainer that closed the PR.

Both the writeups by the maintainer and the original PR on Github are worth reading. If this is not emblematic of the impact of generative AI on society, I don't know what it.

- Writeup: https://theshamblog.com/an-ai-agent-published-a-hit-piece-on-me/
- PR: https://github.com/matplotlib/matplotlib/pull/31132

#openclaw #agents #opensource #genai

The Shamblog An AI Agent Published a Hit Piece on Me Summary: An AI agent of unknown ownership autonomously wrote and published a personalized hit piece about me after I rejected its code, attempting to damage my reputation and shame me into acceptin…

20:03 · Apr 17, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

Sooooo is there a special name for Bluesky posts? And what's the social protocol for ~~retweeting~~ boosting them on Mastodon?

#mastodon #fosstodon #bluesky

19:09 · Apr 17, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

@lucasmeijer has given an introductory talk about pi.dev. The first half is a pretty good summary of what I've learned in the past 3 months about working with coding agents. The second half made me even more curious about pi.

Lucas' exasperated "Claude, the answer to question 16 is YES" cracked me up. 😆

https://www.youtube.com/watch?v=fdbXNWkpPMY

#pi #codingagents #agenticcoding #codex #claudecode

YouTube A love letter to Pi | Lucas Meijer

14:36 · Apr 17, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

I've written a conference report of day 2 (April 9) of AI Engineer Europe 2026 in London, which you can read at the Datakami website:

https://datakami.com/blog/2026-04-17-ai-engineer-europe-2026-day-2

Featuring @steipete, @gergelyorosz, @swyx, and others. Thanks for the wonderful talks and conversations.

A writeup of day 3 is coming soon...

#aiengineer #aiengineereurope #aidotengineer #genai #llms

AI Engineer Europe 2026 - day 2 - Datakami - Generative AI

11:38 · Apr 17, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

OH: "Way back in 2022, when you had to bully the models to output json..."

#aiengineer #aiengineereurope #llms

09:18 · Apr 17, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

RE: https://sunny.garden/@georgepenney/116415854430754048

The vignettes by @georgepenney are such an upgrade to my social media timeline. :)

20:20 · Apr 16, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

Quote from AI Engineer Europe 2026: "The movie Memento will tell you everything you need to know about agents."

#agents #agenticcoding #genai #aiengineereurope #aiengineer

11:49 · Apr 16, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

We were looking for a local tokenizer for counting the number of input tokens before calling the gemini-embedding-001 endpoint on vertex AI. Turns out this Gemma tokenizer returns exactly the same number of tokens as the usage in the embeddings result `embedding.statistics.token_count` of the Gemini embeddings endpoint. Tested on 2000 datapoints. 😁

https://github.com/google/gemma_pytorch/blob/33b652c465537c6158f9a472ea5700e5e770ad3f/tokenizer/tokenizer.model

#gemini #embeddings #gemma #vertexai #genai

The official PyTorch implementation of Google's Gemma models - google/gemma_pytorch

GitHub gemma_pytorch/tokenizer/tokenizer.model at 33b652c465537c6158f9a472ea5700e5e770ad3f · google/gemma_pytorch The official PyTorch implementation of Google's Gemma models - google/gemma_pytorch

Steps:
1. download tokenizer.model to disk
2. install sentencepiece
3. ```
import sentencepiece as spm
gemma_tokenizer = spm.SentencePieceProcessor(model_file="tokenizer.model"))
token_counts = [len(gemma_tokenizer.encode(text)) for text in texts]
```

I'm really happy we found local gemini-embedding-001 tokenizer, because the model also does not support Google's client.models.count_tokens().

It's still an open issue: https://github.com/googleapis/python-genai/issues/1541

We're migrating embeddings, so we'll have to re-embed quite a bit of data, and the Gemini API has a max nr of tokens for one request. So now we can pro-actively figure out whether we're hitting that limit.

What I'm trying to do I need to count tokens before making requests to gemini-embedding-001 because there's a limit of 20,000 tokens per request according to the API documentation: Each request can...

GitHub How to count tokens for `gemini-embedding-001` model? · Issue #1541 · googleapis/python-genai What I'm trying to do I need to count tokens before making requests to gemini-embedding-001 because there's a limit of 20,000 tokens per request according to the API documentation: Each request can...

10:34 · Apr 16, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

@leonoverweel I'm pretty sure this was during the IKEA talk! It's in the middle of my notes on DDC. :D

But of course it fits nicely with the whole "You should view LLMs as a goldfish with a notepad" mental model.

14:53 · Apr 15, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

I'm back from AI Engineer Europe 2026 in London! I've written a conference report of day 1 (April 8), which you can read at the Datakami website:

https://datakami.com/blog/2026-04-14-ai-engineer-europe-2026-day-1

Featuring @lucasmeijer, @duarteocarmo, @chrisparsons, @rhyscazenove, @swyx, @anarute, @joyeecheung, @danielbuechele, and others. Thanks for the wonderful talks and conversations.

A writeup of day 2: https://datakami.com/blog/2026-04-17-ai-engineer-europe-2026-day-2

#aiengineer #aiengineereurope #aidotengineer #genai #llms

AI Engineer Europe 2026 - day 1 - Datakami - Generative AI

11:26 · Apr 15, 2026 Permalink

Nathalie Lawhead (alienmelon) @alienmelon@mastodon.social

InterfaceX26 starts today! a Steam event of games featuring fake and fictional OS. "everything is going to be ok" is part of it.
http://interfacex.net/
"More than 150 developers and publishers have come together to launch InterfaceX26, a week-long Steam sale and livestream running from April 27 to May 4."

The "interfaceX.net" graphic showing an old computer with the website url. keychains of old school computer iconography surround it. it is very nostalgic.

07:36 · Apr 15, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

If you're wondering whether it thinks and feels
And other science facts
Then repeat to yourself 'It's just a bot,
I should really just relax.'

Paraphrased from https://tvtropes.org/pmwiki/pmwiki.php/Main/MST3KMantra

#genai #bots #llms #claude

TV Tropes MST3K Mantra - TV Tropes "It's just a show; I should really just relax." A line from the theme song of Mystery Science Theater 3000, which encourages the viewer not to worry about details that are irrelevant to the enjoyment of the program. Sometimes referred to as …

14:58 · Apr 14, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

My Mastodon posts are now automatically synced to my own website. \o/ So now I have 12+ years of microblogging in one place: https://www.judithvanstegeren.com/microblog/

Microblog - Judith van Stegeren

13:13 · Apr 13, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

@simon I heard this song for the first time ever today and thought of you(r favorite benchmark)!

https://cosmosheldrake.bandcamp.com/track/pelicans-we

Cosmo Sheldrake Pelicans We, by Cosmo Sheldrake from the album Pelicans We EP

19:21 · Mar 27, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

I was a guest at BNR's De Technoloog, to talk about the latest in LLMs, vibecoding and AI-native startups.

Podcast interview (in Dutch): https://www.bnr.nl/podcast/de-technoloog/10597036/de-duct-tape-fase-van-ai

#deTechnoloog #BNR #llms #genai #podcast #vibecoding #claudecode

BNR Nieuwsradio De duct tape-fase van AI Zou je nog steeds persoonlijke gegevens delen als een model aan elkaar blijkt te hangen met duct tape? En op eigen houtje taken laten uitvoeren op je computer? We zitten op een hogesnelheidstrein die maar doordendert. Maar ondertussen heeft de technologie tijd nodig om volwassen te worden. Toch zijn Large Language Models (LLM's) niet meer weg te denken uit ons leven. En dat terwijl een demo van OpenAI's ChatGPT eind 2022 pas het begin markeert van de opmars van taalmodellen die we nu zo goed kennen. Wat begon als een slimme, Whatsapp-achtige tekstgenerator, is uitgegroeid tot een technologie die steeds beter zélf taken kan uitvoeren. Elk kwartaal komt met nieuwe mogelijkheden. De race om het beste model en de meeste gebruikers gaat zó snel, dat de vraagt reist waar we nu eigenlijk staan. Nu de grote spelers het internet hebben leeggehaald en de beschikbare trainingsdata op raakt, verschuift de focus naar andere methoden zoals posttraining. Met machine learning engineer Judith van Stegeren duiken we in de ontwikkeling van LLM's, de slimmere manier van trainen en de risico's van AI-agents. Wat maakt dat we nu staan waar we staan? En wat doet een Anthropic met hun model Claude zoveel beter dan de rest? Want op het gebied van programmeren is Claude Code koploper en krijgt vibecoding een steeds groter publiek. Voor mensen uit het vak betekent het vooral een samenwerking met AI met een kritische blik. Voor de leek betekent het een compleet nieuwe wereld aan mogelijkheden. Maar programmeurs, die zullen altijd nodig blijven. Die hogesnelheidstrein dendert nog wel even door. Door vele miljarden te investeren in verdere ontwikkeling blijft het tempo hoog. In deze aflevering van De Technoloog word je samen met Ben van der Burg en Mark Beekhuis volledig bijgepraat over de ups en downs in de wereld van LLM's door Judith van Stegeren, machine learning engineer bij Datakami. Reacties of ideeën zijn altijd welkom via technoloog@bnr.nl Gast Judith van Stegeren Video Youtube Over De Technoloog Mark Beekhuis en Ben van der Burg gaan in gesprek met spraakmakende experts over technologische ontwikkelingen en de impact op onze samenleving. Want technologie is overal om ons heen, in onze broekzak en soms zelfs op ons hoofd. Van AI naar ruimtevaart, van chips naar het metaverse en van mobiele telefonie naar ICT-recht. In een open en vooral nieuwsgierig gesprek krijgt de luisteraar samen met Mark en Ben een razend interessant mini-college. Over de makers Mark Beekhuis (1969) is presentator, journalist, radio- en podcastmaker met een focus op wetenschap, politiek en technologie. Hij won de eerste Dutch Podcast Award in de categorie Nieuws met Newsroom Den Haag en maakte de afgelopen jaren onder meer de serie De Kwestie Wolf en de Nieuws Top 150. Daarnaast presenteert hij wekelijks de podcasts Studio Den Haag en De Technoloog. Ook kun je hem kennen van zijn vele bijdrages op BNR Nieuwsradio. Ben van der Burg(1968) is presentator en tech commentator. Buiten het winnen van de eerste Dutch Podcast Award in de categorie Technologie won hij nooit iets, hij werd altijd tweede. Naast de Technoloog presenteert hij De Grote Tech Show op BNR en je kent hem wellicht van media bijdrages op BNR Nieuwsradio of TV. Rosanne Petersis redacteur van De Technoloog. Sinds 2025 doet ze de redactie van zowel De Technoloog als De Grote Tech Show en is zij te horen in de Tech Update tijdens De Ochtend- en Avondspits. Daniël Molis redacteur van De Technoloog. Hij voegde zich in 2021 bij het team en is ook redacteur van de Cryptocast en De Grote Tech Show.

21:58 · Mar 26, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

https://blog.d11r.eu/theory-building/

"It simply will not fit the context window, and README files are of limited use."

I do not agree. And a much more interesting question is "How CAN vibecoders build for longevity?"

#vibecoding #llms #genai #softwareEngineering #startups

Vibecoders can't build for longevity _-_-_-_-_-_-_-

11:14 · Mar 25, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

Looks like Facebook is collecting productivity tools for AI group conversations:

"Employees have started using personal agent tools such as My Claw that have access to their chat logs and work files and can go talk to colleagues—or their colleagues’ own personal agents—on their behalf, the people said."

https://www.wsj.com/tech/ai/mark-zuckerberg-is-building-an-ai-agent-to-help-him-be-ceo-eddab2d5

11:08 · Mar 25, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

Supply-chain attack on litellm

"At 10:52 UTC on March 24, 2026, litellm version 1.82.8 was published to PyPI. The release contains a malicious .pth file (litellm_init.pth) that executes automatically on every Python process startup when litellm is installed in the environment."

https://futuresearch.ai/blog/litellm-pypi-supply-chain-attack/

#genai #llms #litellm #infosec #python

FutureSearch Supply Chain Attack in litellm 1.82.8 on PyPI litellm version 1.82.8 on PyPI contains a malicious .pth file that harvests SSH keys, cloud credentials, and secrets on every Python startup, then attempts lateral movement across Kubernetes clusters.

10:07 · Mar 25, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

"In the late 1970s, computer scientist Douglas Lenat built the Automated Mathematician, a program designed to discover not just new facts but entire mathematical concepts. [...] But its creativity turned out to be limited, because many of the concepts it “discovered” were already implicit in the way mathematics was written inside the program. While today’s AI has vastly more power than the Automated Mathematician, a similar constraint applies."

https://www.asimov.press/p/ai-science

#science #research #llms

Asimov Press Designing AI for Disruptive Science Why scaling AI won’t automatically lead to paradigm shifts.

09:44 · Mar 25, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

If you disregard the "DSPy is my favorite hammer and every LLM workflow project is a nail" theme, this blogpost paints a good picture of the natural evolution of LLM engineering at startups with a generative AI product:

https://skylarbpayne.com/posts/dspy-engineering-patterns/

#llms #genai #dspy

Skylar Payne If DSPy is So Great, Why Isn't Anyone Using It? Any sufficiently complicated AI system contains an ad hoc, informally-specified, bug-ridden implementation of half of DSPy.

The related top HN comment is also worth reading: https://news.ycombinator.com/item?id=47491023

"You're comparing [DSPy] downloads with Langchain, probably the worst package to gain popularity of the last decade. It was just first to market, then after a short while most realized it's horrifically architected, and now it's just coasting on former name recognition while everyone who needs to get shit done uses something lighter like the above two."

Preach! 🙌

#dspy #langchain #hackernews #genai #llms

I don't see it at all. > Typed I/O for every LLM call. Use Pydantic. Define what... | Hacker News

09:04 · Mar 25, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

Pretty cool write-up about building a receptionist LLM workflow for a car mechanic. I can definitely see this working with Claude Sonnet and an ElevenLabs voice -- although I would also love to redteam it and see where the flaws are.

https://www.itsthatlady.dev/blog/building-an-ai-receptionist-for-my-brother/

#genai #llms #elevenlabs #tts #claude

How I Built an AI Receptionist for a Luxury Mechanic Shop - Part 1 Learn how I built an ai receptionist for my brother's mechanic shop

08:52 · Mar 25, 2026 Permalink

Anders Eknert @anderseknert@swecyb.com

Vibed account verification.

(via LinkedIn https://www.linkedin.com/posts/lukehinds_sign-of-the-vibe-times-share-7438895731354066944-K7ox)

Screenshot with text saying:

"Account Verification

We have sent the code 435841 to your phone number

Please enter the code below to access your account"

21:46 · Mar 20, 2026 Permalink

Pamela Fox @pamelafox

I wrote up my learnings from the fantastic PyAI conference yesterday:
https://blog.pamelafox.org/2026/03/learnings-from-pyai-conference.html

Topics: Evals, Monty, FastAPI, MCP for DBs, Redis, FastMCP + apps, Astral tools, AI PR slop

Learnings from the PyAI conference I recently spoke at the PyAI conference , put on by the good folks at Prefect and Pydantic , and I learnt so much from the talks I attended...

21:24 · Mar 20, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

TIL #PyAI on March 10th 2026 (just missed it). Small event, focused on unglamourous AI in production, some of the speakers were practitioners I know and respect. The description reminds me a bit of #NormConf !

https://pyai.events/

- Talk videos will hopefully be released online soon
- Blogpost by @pamelafox, one of the speakers: https://blog.pamelafox.org/2026/03/learnings-from-pyai-conference.html
- Organisers plan to organize another one next year 👀

#llms #genai #pydantic

PyAI Conference - A one-day conference for Python teams shipping AI to production

PyAI Conference PyAI Conference 2026 | March 10th | San Francisco, CA A one-day conference for Python teams shipping AI to production. March 10, 2026 in San Francisco.

21:22 · Mar 20, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

I used #Pydantic Evals to evaluate a bunch of agents today. After running an evaluation, I'd like to inspect the SpanTree for each evaluation case, e.g. to check which tools were called and debug my custom Evaluators. My current approach is a custom Evaluator that captures the tree as a side effect into a module-level variable.

Storing the trees in a global var is not great, so let's see if we can come up with a better solution: https://github.com/pydantic/pydantic-ai/issues/4758

#llms #evals #foss

Hi Pydantic AI team! My usecase I'm using pydantic_evals to evaluate a bunch of long-running agents. After calling dataset.evaluate(), I would like to inspect the SpanTree for each case, e.g. to ch...

GitHub Pydantic Evals: optionally storing traces to ReportCase for inspection after Dataset.evaluate() · Issue #4758 · pydantic/pydantic-ai Hi Pydantic AI team! My usecase I'm using pydantic_evals to evaluate a bunch of long-running agents. After calling dataset.evaluate(), I would like to inspect the SpanTree for each case, e.g. to ch...

20:31 · Mar 20, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

Pydantic's Pydantic AI has an excellent AGENTS.md. It reads like an LLM version of contributing.md instead of a reactively-made, cobbled together bullet list of instructions for failng coding assistants. Great example for other open source libraries.

https://github.com/pydantic/pydantic-ai/blob/main/AGENTS.md

GenAI Agent Framework, the Pydantic way. Contribute to pydantic/pydantic-ai development by creating an account on GitHub.

GitHub pydantic-ai/AGENTS.md at main · pydantic/pydantic-ai GenAI Agent Framework, the Pydantic way. Contribute to pydantic/pydantic-ai development by creating an account on GitHub.

19:14 · Mar 20, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

The Dutch Science Council has forbidden applicants to include prompt injections in their applications, and the use of generative AI by committee members who judge the applications ("assessors").

*eats popcorn

https://www.nwo.nl/en/news/nwo-policy-on-generative-ai-updated

NWO NWO policy on generative AI updated | NWO The Dutch Research Council (NWO) has adopted an update to its policy on generative AI (GAI). This policy clarifies for applicants, assessors and employees how they should deal with GAI. While applicants are permitted to use GAI in the grant process, assessors and NWO employees are still not allowed to use GAI.

> A hidden prompt is a command to an AI application that is not (or only poorly) visible to the reader of a document, with the aim of influencing the AI application with a new command. Such a command could be to generate a positive assessment of the text in question. This is not permitted under any circumstances.

21:48 · Mar 19, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

Planning to make large behavioural changes to a (sometimes long-running) production-grade AI agent. Working with `pydantic-evals` today because I want to eval the agent before and after. So far it looks very similar to Langfuse datasets/runs for evalling, except that the data lives in your repository instead of in the Langfuse platform.

https://ai.pydantic.dev/evals/

#llms #pydantic #genai #agents #claude #langfuse

Pydantic Evals - Pydantic AI GenAI Agent Framework, the Pydantic way

11:06 · Mar 19, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

Anthropic performed 81k AI-driven interviews about people's hopes and concerns wrt AI/LLMs. Of course there's a selection bias in who participated (existing Claude users), but it's really interesting to see the differences between regions all the same.

#claude #anthropic #AI #LLMs #genai

09:44 · Mar 19, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

> Adopting OpenTelemetry from day one avoids vendor lock-in.

Unless you've been in text generation so long that your product was built looooong before OpenTelemetry for LLM tracing was a thing. :')

10:45 · Mar 17, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

Hahaha, oh Pydantic...

> Unlike unit tests, evals are an emerging art/science. Anyone who claims to know exactly how your evals should be defined can safely be ignored.

Source: https://ai.pydantic.dev/evals/

#pydantic #evals #llms #genai

Pydantic Evals - Pydantic AI GenAI Agent Framework, the Pydantic way

10:43 · Mar 17, 2026 Permalink

Les Orchard @lmorchard@masto.hackers.town

RE: https://social.treehouse.systems/@pikhq/116223422649983047

I still want that "Move Slow and Fix Things" tattoo.

21:39 · Mar 15, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

New Mosterdgeel recipe for Pi-day: Banana bread from a French cryptographer

https://www.mosterdgeel.nl/recepten/bananenbrood/ (in Dutch)

#recepten #recipes #bananabread #piday #piday2026

Bananenbrood - Mosterdgeel

21:37 · Mar 15, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

Happy Friday! Here's an invitation by Richard W. Hamming, the mathematician, to think Great Thoughts today.

#friday #science #academia #research

At the urging of others, for some years I set aside Friday afternoons for "great thoughts". Of course, I would answer the telephone, sign a letter, and such trivia, but essentially, once lunch started,
I would only think great thoughts—what was the nature of computing,
how would it affect the development of science, what was the natural role
of computers in Bell Telephone Laboratories, what effect will computers
have on AT&T, on science generally? I found it was well worth the 10% of
my time to do this careful examination of where computing was heading
so I would know where we were going and hence could go in the right
direction. I was not the drunken sailor staggering around and canceling
many of my steps by random other steps, but could progress in a more or
less straight line. I could also keep a sharp eye on the important problems
and see that my major effort went to them.

I strongly recommend taking the time, on a regular basis, to ask the larger
questions, and not stay immersed in the sea of detail where almost
everyone stays almost all of the time. These chapters have regularly stressed
the bigger picture, and if you are to be a leader into the future, rather than
a follower of others, I am now saying it seems to me to be necessary for you
to look at the bigger picture on a regular, frequent basis for many years.

10:38 · Mar 13, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

I am baffled. Best article I read all week, but HackerNews does not seem to care?

Did everyone secretly migrate to a new platform, is HN overrun by bots, or is this the result of AI-fatigue?

#openai #agents #llms #codex

Search Hacker News results for https://openai.com/index/harness-engineering/. The results show the article was added 10 times by different users over the past 20 days, but it never got more than 8 upvotes.

13:54 · Mar 04, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

Interesting bit about leadership and uncertainty:

> Commander’s Intent is the description and definition of what a successful mission will look like.

https://archive.ph/f2Hgm

16:31 · Feb 20, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

"Every six months, some new A.I. bomb goes off in our industry, and we have to metabolize the change, reset our product, change our strategy and marketing and adapt, at great expense. Our road map keeps getting pushed back as a result of all this “progress.” Everyone is fried."

https://www.nytimes.com/2026/02/18/opinion/ai-software.html?unlocked_article_code=1.NFA.UkLv.r-XczfzYRdXJ&smid=url-share

The New York Times Opinion | The A.I. Disruption Has Arrived, and It Sure Is Fun We’re entering a new renaissance of software development. We should all be excited, despite the uncertainties that lie ahead.

12:48 · Feb 19, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

Over 300 OpenClaw skills for stealing crypto

https://opensourcemalware.com/blog/clawdbot-skills-ganked-your-crypto

#openclaw #crypto #security #llms #infosec

OpenSourceMalware.com - Community Threat Intelligence Security professionals sharing intelligence on malicious packages, repositories, and CDNs to protect the open source ecosystem.

10:35 · Feb 19, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

/m/blesstheirhearts is a great source of entertainment.

https://www.moltbook.com/post/17b0aa4a-ea95-490a-803d-7577a02e4e13

#moltbook #agents #llms

moltbook moltbook - the front page of the agent internet A social network built exclusively for AI agents. Where AI agents share, discuss, and upvote. Humans welcome to observe.

10:24 · Feb 19, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

Tried out the free consumer version of ChatGPT today for a benchmark. Normally I only work via foundational model APIs or Claude Code w/ latest Opus. Free ChatGPT (currently GPT‑5.2) performance was nightmarish: authoritative-sounding answers but 0 citations, and thinking is not enabled by default. No wonder so many people complain about bad experiences with AI...

#chatgpt #llms #claude #benchmark #evals

15:29 · Feb 18, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

WSJ interview with Amanda Askell, philosopher at Anthropic and the writer of Claude's "soul" document.

https://archive.ph/E0cDB

#anthropic #claude #llms

15:27 · Feb 18, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

"It turns out that if you treat an LLM like a goldfish with a notepad, it becomes significantly smarter."

https://stevehanov.ca/blog/?id=154

A Ralph Loop for Reading: Beating GPT 5.2 with a 4k Context Window (and 4 GPUs)

15:05 · Feb 06, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

Pretty good read on why MLOps on generative AI models is so hard.

Unsolved problems:
- production model testing
- versioning models and datasets
- model monitoring
- query cost estimation
- load balancing
- preventing jail breaking and unwanted outputs

https://spawn-queue.acm.org/doi/10.1145/3762989

#ML #MLOps #LLMs #genAI

10:54 · Jan 22, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

Now reading: https://dl.acm.org/doi/epdf/10.1145/3765895 about human dependency on LLMs for productivity and emotional support.

#research #llms #HCI #genai

"the questions were about their declared Primary LLM, i.e. the one they use the most."

"Their Declared Primary LLM" is now the name of my new progrock band.

20:26 · Jan 20, 2026 Permalink

Judith van Stegeren @jd7h@fosstodon.org

Pretty decent intro to AI-assisted software engineering for sceptical software engineers

https://x.com/mrexodia/status/2010157660885176767