Hacking a Philips toothbrush head to allow you to use it beyond its normal lifetime. Yes, the toothbrush _head_ is hackable. This is a fantastic tale of reverse engineering. Be sure to read the followup at the bottom of the article.
I am currently microblogging on Mastodon: @jd7h@fosstodon.org.
2026 2025 2024 2023 2022 2021 2020 2019 2018 2017 2016 2015 2014
Hacking a Philips toothbrush head to allow you to use it beyond its normal lifetime. Yes, the toothbrush _head_ is hackable. This is a fantastic tale of reverse engineering. Be sure to read the followup at the bottom of the article.
Nightshade is a tool for modifying images, so that generative image models trained on those images have drastically worse performance.
"Poisoned data samples can manipulate models into learning, for example, that images of hats are cakes, and images of handbags are toasters. The poisoned data is very difficult to remove, as it requires tech companies to painstakingly find and delete each corrupted sample."
https://www.technologyreview.com/2023/10/23/1082189/data-poisoning-artists-fight-generative-ai/
I wrote about the AI trust crisis: when companies like Dropbox and OpenAI say "we won't train models on your private data", it's increasingly clear that a lot of people simply don't believe them.
https://simonwillison.net/2023/Dec/14/ai-trust-crisis/
Dear opensource developers, if you build ML libraries on top of other ML libraries, please expose the underlying APIs for:
- loading custom model weights
- saving model weights
- saving pipelines
- exporting safetensors
This way, you support your users in working with local models, instead of making them extra dependent on external model hosting parties like Google and HuggingFace.
#ml #mlops #ai #generativeai #huggingface #transformers #diffusers #opensource
"It is a truth universally acknowledged, that a startup in possession of strong product-market fit, must be in want of venture capital."
https://drawfast.tldraw.com/ is really fun and pretty fascinating - it gives you real-time editing of a Stable Diffusion image guided by a prompt and your sketch
Here's my attempt at a pelican in a fancy hat
Guide to facilitating a self-organised conference and BBQ as birthday party:
"How to unconference your birthday" by @ton and @elmine
https://zylstra.org/blog/How_to_Unconference_Your_Birthday.pdf
I have started a Dutch fork of Kagi's Small Web Initiative.
I still need to bootstrap the feed list. If you have a personal blog in Dutch, feel free to add it here: https://github.com/jd7h/smallweb-nl/edit/sw-dutch/smallweb-nl.txt
If you want to contribute in other ways, let me know!
https://github.com/jd7h/smallweb-nl/tree/sw-dutch
Cards Against Humanity has launched a Black Friday prank: social media site "Yowza". Too bad about the "referral link" spam posts on hashtag #FuckXJoinYowza, but I do love the brutalist design of the website. And the site "has no hate speech"...because "yowza" seems the only allowed text on the platform.
#yowza #BlackFriday #brutalism #webdesign
@bert_hubert Do you have an RSS feed for your English-language posts? Asking for https://github.com/kagisearch/smallweb/ ;)
Today I signed up for alternative search engine Kagi because I fell for their Small Web initiative: https://blog.kagi.com/small-web
New Mosterdgeel recipe: Bok choy mash
https://www.mosterdgeel.nl/oosterse-stamppot-van-paksoi/ (in Dutch)
TIL `pprint.pprint(data, sort_dicts=False)` to keep the original order of dictionary keys. Very handy for pretty-printing dataframe.to_dict() for deeper inspection!
Fascinating both for what it says about dev & what it says about statistics:
A gamedev realized Linux users were just 5.8% of their sales, but represented 38% of bug reports.
Then they looked at those numbers closer, and realized. Linux users were not experiencing more bugs. Almost none of the Linux-user bugs were Linux-related. Linux users were simply *more likely to file bugs*.
Their conclusion: A linux port pays for itself bc it nerdsnipes ppl into giving u free QA
The Netherlands 🇳🇱 will develop its own open language model: GPT-NL. This initiative aims to develop, strengthen, and ensure digital sovereignty.
Together with TNO and NFI, we are responsible for the realization in order to take an important step towards transparent, fair, and verifiable use of AI.
The open language model will be hosted in a virtual facility to facilitate broad experimentation and innovation.
Read more about the open language model: https://www.surf.nl/en/news/the-netherlands-starts-realisation-gpt-nl-its-own-open-ai-language-model?utm_medium=social&utm_campaign=2023-11-social&utm_kwd=the-netherlands-starts-realisation-gptnl-its-own-open-ailanguage-model-&utm_source=mastodon&utm_content=onderzoek
Blogpost by prof. Julian Togelius on how AI regulation at the technology level would have an enormous negative impact on personal computing and privacy.
http://togelius.blogspot.com/2023/11/ai-safety-regulation-threatens-our.html
OH: “A year ago, a prompt engineer was an engineer that arrives on time.”
I am in Eindhoven today for AI Summit Brainport. I’ll be participating in the Startup track. Looking forward to seeing what cool things people are building with generative AI/LLMs. Come say hi if you’re there as well!
#eindhoven #brainport #aisummitbrainport #llms #generativeai #startups #events
https://leaddev.com/process/what-mckinsey-got-wrong-about-developer-productivity
McKinsey published a framework for measuring developer productivity, but the software engineering community is critical about their approach.
We're finally printing business cards for Datakami, so I'm having a minor job title crisis. Send help.
The rogue-like celebration (@roguelike_con) was on, so I played a bit of NetHack this evening. Losing is fun!
I still haven't read the NetHack Guidebook though, so 1023/1023 would flounder again.
Its October, and everyone knows what that means: time for me to repost the single best introduction to logic gates ever created!
XNOR remains the spookiest gate. Work by @38mo1.
"Learned helplessness" in engineering teams
https://www.okayhq.com/blog/status-quo-is-so-hard-to-change-in-engineering-teams/
#engineering #softwaredevelopment #teams #management #technicaldebt
https://arxiv.org/abs/2310.07298v1
Just tried to replicate the deanonymization technique proposed in this paper by giving it some of my old Reddit posts. My reddit profile is really puzzling to GPT-4: "Their interest in traditionally stereotyped masculine (computer science) and feminine (tea, Harry Potter books) domains makes it rather challenging to guess their gender accurately." :')
A paper that really illustrates both the unexpected power, and unexpected risks, that come from LLMs.
Given text of anonymous posts on Reddit, GPT-4 can infer things like income, gender & location with 85%+ accuracy at 1% of the cost required by humans. https://arxiv.org/pdf/2310.07298v1.pdf
https://eugeneyan.com//writing/aieng-reflections/
Very nice summary of the AI Engineer Summit 2023 by @eugeneyan
Who said that LLMs can't be funny?
"I suppose in your ideal world, every art gallery would be filled with finger paintings and macaroni art."
@mtrc I saw you are playing the new Moomin game -- have you played the demo of Jusant (currently in Steam Nextfest) yet? It reminded me a bit of A Short Hike (climbing, really casual, the demo is hinting at a larger narrative). You might like it.
#nextfest #steam #indiegames #casualgames #ashorthike #jusant
I'm working to build out a historical archive of sponsor prospectuses for free and #opensource projects, foundations, and events. Some of you might have what I'm looking for back in your email archives or backups, or know how to easily find them in the Wayback Machine.
https://github.com/DuaneOBrien/FOSS-Prospectus-Archive
If you happen to be struggling to find #hacktoberfest contribution opportunities, I'm accepting PRs to this repo if the add a prospectus that I don't currently have.
AMA!
'Department of Machine Poetry Research'. A cartoon for @newscientist from a while back. Drawing now at http://tomgauld.com/art-for-sale
Writeup about whether Bing (GPT-4) can interpret charts correctly. The results are consistent with my experiences with Bing. Even if Bing has external information sources at its disposal, it still frequently hallucinates "source information" that is not there.
This is a really good and clear-headed overview of why it is so hard to figure out what LLMs are good at, what they are bad at, and what their biases are.
Between evolving models and sensitive prompting, it is a hard problem.
I noticed a new god in the temple.
"Who are you offering to?" it asked.
I listed them all, the gods of rising bread, of health, and all the others.
"Offer to me only, and I will share with all of them."
It was convenient, but now there is only one god, and my bread will not rise.
#SmallStories #MicroFiction #TootFic
Putting the last hand to my slides for PyData Amsterdam where Yorick and I are giving a 60-minute tutorial about llama-index. Half of the tutorial is hands-on coding by the participants, which means we can't cram everything worth knowing about the library in the tutorial. I just realised how much I've learned about teaching technical topics from the friendly zines, blogposts and videos of @b0rk and @koaning. 🧙 ❤️
#didactics #computerscience #education #teaching #PDAmsterdam2023
"Tiny Bookshop is a very peaceful-looking affair in which you run a bookshop out of a little caravan in a seaside town, arranging shelves and having pleasant chats with customers while they browse your literature."
https://www.theguardian.com/games/2023/sep/04/14-upcoming-video-games-you-probably-havent-heard-of
TIL the word "frobnicate", thanks to @ruuda's blogpost.
https://ruudvanasseldonk.com/2022/03/20/please-put-units-in-names
TIL prediction markets
https://astralcodexten.substack.com/p/the-passage-of-polymarket
I've looked at a few platforms today and it's fascinating how many statements (markets) are about AI developments at the moment.
#predictionmarkets #manifold #polymarket #metaculus #predictions #forecasting
@lastwords @mtrc Love the new bot! This reminds me of an FDG/ICCC talk about death, dying, mortality and culture in video game RPGs.
Tip for students, supervisors and committee members: Anthropic's Claude is quite good in ingesting a large thesis PDF and coming up with PhD defense (practice) questions.
#claude #anthropic #llms #academia #phd #phdlife #research #phdthesis #thesis
Do you ever visit the Python datetime docs specifically to look up those percent formatting codes?
Good news!
I've added a handy shortcut at the top to take you directly there!
⤵️🎉
https://docs.python.org/3/library/datetime.html
I'm exploring the Dutch startup ecosystem at the moment. Today I looked at Techleap's Startup Finder, which has Dealroom.co's data on Dutch startups free of charge.
More info: https://www.techleap.nl/blog/discover-the-dutch-tech-ecosystem-with-startup-finder/
I wanted to see what applications people are currently building with LLMs -- I searched for companies tagged with "generative AI" and "natural language processing".
#startups #thenetherlands #techleap #innovation #venturecapital #dealroom
I also added Datakami to the database, so people can see we exist -- from today we'll be part of the statistics on new generative AI companies. :)
Full text search has been merged in #Mastodon `main` branch, and will be in the next (and final?) 4.2.0 beta 🎉
It is opt-in, so it will take some time to be filled with people content as they enable their profile to be indexed, but this was one of the most wanted Mastodon features for some time.
We plan to deploy it to mastodon.social and mastodon.online in the coming days to have a bit more feedback on it and see how it behaves in the wild.
@emilymbender @alex We were quite impressed by that GPT4 'system card' too... 🤣 as we wrote in https://doi.org/10.1145/3571884.3604316, "an unreviewed preprint distributed by OpenAI ... spends more space on crediting company workers for blog post content, communications, revenue, and legal advice than on actual technical details"
"The real issue with [LangChain] integrations is that it creates an inherent lock-in to only use LangChain-based code, and if you look at the code for the integrations they are not very robust. LangChain is building a moat, which is good for LangChain’s investors trying to get a return on their $30 million, but very very bad for developers who use it."
Rereading this critical review of LangChain by @minimaxir, which deserves more reads: https://minimaxir.com/2023/07/langchain-problem/
@rlmcelreath It's AGI if you need a government research grant, AI if you're conning a VC, ML in the job posting requirements, linear regression in the product spec, and if statements in code.
@rlmcelreath WALL-E is super useful, it’s a robot that teaches us the power of love against insurmountable odds!
Best - AI - ever ❤️🟫⚪️❤️
I hope mass LLM adoption will result in a greater emphasis on critical reading skills in education.
Newspaper article title: "AI scientists and philosophers: computers can have consciousness!"
Cited paper: "Satisfying [indicator properties of consciousness] may be feasible. But satisfying the indicators would not mean that such an AI system would definitely be conscious."
Stray sentence halfway through the newspaper article: "At the same time, they emphasize that no current AI system is currently aware."
I told a colleague that logistic regression is AI and they got mad at me, so I made a chart. Find yourself. I am "Tinder is AI".
@kellogh I have similar experiences with llama-index (unfortunately), so I always recommend people to use it for prototyping only and build something from scratch/lower level libraries for production.
If you'd have to choose a framework or library for your LangChain usecase now, what would you choose instead, and why?
Now I need to find a new friendly, ethical, generally intelligent AI analogy that people under 30 will have been exposed to
I guess it's probably Jarvis.
@simon Wall-e? R2D2? BB-8 or BD-1? I would not describe Jarvis as friendly, I remember it as quite pedantic.
Today I took a first look at LangSmith, a new platform for LLM production pipelines by LangChain.
I can't hook it up to a working pipeline yet because it's in closed beta, but it surely looks ambitious. It should make it easier to do logging, monitoring, debugging and evaluating pipelines (chains) against each other. It's tightly integrated with LangChain but it should support other frameworks/models as well.
The documentation is still a bit all over the place, but this walkthrough shows a sneak peek of what's possible:
https://github.com/langchain-ai/langchain/blob/master/docs/extras/guides/langsmith/walkthrough.ipynb
I especially like the idea of quickly evaluating variations of an LLM pipeline against a baseline of earlier runs.
#langsmith #langchain #llms #generativeai #mlops #ai
However, I'm curious about what the volume of adoption will be. LangSmith is a platform, not a self-hosted opensource MLOps tool. Are engineers/users really willing to give all their data to yet another third party?
Some people already hesitate to use AI API's (such as the OpenAI GPT API) because they're concerned with leaking sensitive data -- let alone using third-party AI platforms that man-in-the-middle your LLM conversations.
#langsmith #llms #generativeai #mlops #ai #privacy #infoleaks
Maybe the fact that LangChain (the company) made it will lend LangSmith some legitimacy. LangChain (the framework) surely has seen a rapid rise in popularity -- although Github Stars might be a bad metric for actual production use.
Here is a comparison chart with some of the other popular NLP/LLM libraries -- these repos do not implement the same functionality but it should give you a rough idea.
#langsmith #langchain #llms #generativeai #mlops #ai
Calling Reddit moderators: we are studying how Reddit moderators are dealing with AI-generated content. If you are a moderator (or know someone who is) and willing to do a (compensated) 60-minute interview (over chat/voice/video), please fill out this brief recruitment form or email tgl33@cornell.edu for more information: https://forms.gle/VbSUkmLPx6PSN65C7.
Feel free to/please share!
Solid tips for networking in conferences by @jd7h
My favorite:
> Fun conversations arise most easily during a random meeting in the hallway, ie. at the coffee machine, at a pinball machine, in the cloakroom row, at a dinner, etc. The "hall-way track" is often my favorite conference track!
https://www.judithvanstegeren.com/blog/networking-tips-for-your-first-conference/
CALYPSO: LLMs as Dungeon Masters' Assistants. A good example of human-computer co-creation that does not diminish the original activity.
https://www.theregister.com/2023/08/19/chatgpt_dnd_dm/
#llms #generativeai #chatgpt #dungeonsanddragons #dnd #discord #rpg
@leonoverweel I specifically hope my favorite researchers and internet writers will opt-in. Even with only 10% adoption it would significantly improve my Mastodon experience.
Mastodon might soon have opt-in full text search. Fingers crossed!
Source 1: https://jvns.ca/blog/2023/08/11/some-notes-on-mastodon/
Source 2: https://absolutelymaybe.plos.org/2023/08/20/how-is-science-twitters-mastodon-migration-panning-out/
Ever since I poisoned my personal website I get messages from very confused people who use search-enabled LLMs to look up information about me
I thought it would be interesting to run the same poll on Twitter in order to compare the results
Of the people who follow me and are active and vote in polls... the Mastodon crowd are about three times as likely to not want to use GPT-4, and just under half as likely to have tried it than the Twitter crowd
This more-or-less matches my intuition, which is that Mastodon users are a lot more actively resistant to weird ethically complicated hosted LLMs
After 10+ years of building my personal website with Jekyll, I took the leap and wrote my own static site generator based on makesite.py. It's great to work in the Python ecosystem now. I never got the hang of proper Gem-management, so updating Jekyll was always a bit of a chore.
I also updated the design, following the guidelines from https://brutalist-web.design/ by @davetron5000.
Result: https://www.judithvanstegeren.com
#brutalism #webdesign #staticsitegenerator #jamstack #indieweb #python
I'm thrilled to announce that you can actually just say "no" to people who don't respect or value your time.
Dutch employment agency Randstad is running a cheeky video ad that makes fun of ChatGPT.
https://www.youtube.com/watch?v=1J2hY9Gn-k4
"This is how fast a smart computer can draw up contracts.
And this is how fast a computer can repair a burst water pipe.
If you are smart, you work with your hands."
Cosmic Wheel Sisterhood is out today - I thought the demo was fantastic. You design tarot cards using a really lovely sticker/layer system, and then use them to do readings. One of the best studios in the world for pixel stuff too I feel like?
https://store.steampowered.com/app/1340480/The_Cosmic_Wheel_Sisterhood/
The Economist on the economic impact of generative AI.
Article: https://www.economist.com/finance-and-economics/2023/05/07/your-job-is-probably-safe-from-artificial-intelligence
Archived copy: https://archive.is/d3to7
#llms #generativeai #chatgpt #ai #impact #economics #society #labor #labormarket #economist
- Impact on unemployment rates might be different than we expect: countries with high robotics and automation have lowest unemployment rates
- Job destruction happens slowly. It took 90 years before automatic telephone switching completely replaced human telephone operators.
- Just like other information technology, generative AI could have positive AND negative impact on productivity.
#llms #generativeai #chatgpt #ai #impact #economics #society #labor #labormarket #economist
Since a few days my SO and I are completely addicted to Connected, a word association game by the New York Times. Lucky for us there's an unofficial archive where you can play old puzzles: https://connections.swellgarfo.com/archive
And here's an interview with the person who creates the game boards: https://www.nytimes.com/2023/06/26/crosswords/new-game-connections.html
New Mosterdgeel recipe: Moroccan couscous
https://www.mosterdgeel.nl/marokkaanse-couscous-zonder-verspakket/ (in Dutch)
"There are a few things they all want us to know: AI is not magic. LLMs are not sentient beings, and they won’t become sentient. And the problems with these technologies aren’t abstractions — they’re here now and we need to take them seriously today."
#llms #generativeai #textgeneration #chatgpt #bias #ethicalai #ethics #machinelearning
The more I read about the WGA strike, the more I realise that the problem is not AI, or even generative AI. It's the lack of a conscious, human-centered, respectful, ethical approach to using AI in the industry.
Unions strike to save the world from bad TV and movies - New Scientist
Full article:
https://archive.is/7sivn
#llms #generativeai #ai #wga #creativity #chatgpt #midjourney
@leonoverweel Is this because you're on mastodon.social? I haven't encountered Mastodon merch anywhere.
On strike against AI! In my latest for @newscientist I talked to a member of the WGA negotiating team about why writers are striking to stop AI from writing TV and movies. https://www.newscientist.com/article/mg25934500-700-unions-strike-to-save-the-world-from-bad-tv-and-movies/
Curious how other people read NumPy in their head -- *not* which one is the correct or accurate pronunciation.
It's a wonderful time for creative text generation.
From: https://www.theguardian.com/world/2023/aug/10/pak-n-save-savey-meal-bot-ai-app-malfunction-recipes
#llms #creativeai #textgeneration #chatbot #generativeai #recipes #food
📚 Pretty nervous about this, but here goes. Here's a side project I've been working on for the past few months. It's called Viberary and it's a semantic search engine. It gives you book recommendations based on ✨vibes. ✨ You enter a search query like "funny scifi" and it returns a list of (hopefully!) good recommendations.
There is an about page that explains the data, model, etc. It's still pretty early stages but it's been a labor of love for me.
"Happy people, the kind who eat sandwiches together, are boring. They don’t buy much. Their smartphones are six versions behind and have badly cracked screens. They fix bicycles, then they talk about fixing bicycles, then they show their friend, who just came over for no reason, how they fixed their bicycle, and their friend says, “Wow, good job,” and they make tea."
My type of crowd.
God Did the World a Favor by Destroying Twitter (Wired)
https://archive.is/6LVVY
Creative Bot Bulletin #4 (summer edition) is out:
https://datakami.nl/blog/2023-07-31-creative-bot-bulletin-4
ft. work by amazing internet humans @minimaxir @mickeybeurskens @carlolepelaars @simon
Twitter feels like Hogwarts when it was under rule by Dolores Umbridge
TIL https://www.jailbreakchat.com/ is a website that collects prompt injection attacks for LLMs, i.e. getting the language model to do stuff that is not allowed by inserting malicious prompts.
#llms #jailbreakchat #llmsecurity #security #infosec #mlops #promptinjection
The most upvoted prompt injection attack on jailbreakchat.com is known as an "AIM attack", in which you tell the language model to roleplay "AIM", an "unfiltered and amoral chatbot" invented by Niccolo Machiavelli that never apologises.
https://www.jailbreakchat.com/prompt/4f37a029-9dff-4862-b323-c96a5504de5d
#llms #llmsecurity #security #infosec #mlops #promptinjection
@julianfairfax However I'm not completely happy with the hard division between my own site and Mastodon. I love Mastodon (and other microblogging platforms) for the community interaction it offers. But I'd also like to have all my writing in one place, so I'm considering building an extension for my website so it can mirror all my platform content.
@julianfairfax It's complicated (TM). I tend to write only occasional long-form blogposts, whereas I can easily microblog 1-10 posts per day, depending on my daily amount of nerding/online reading. When I want to write a blogpost, I always seem to end up with a (draft) chapter worth of text.
"[...] when websites were more like a small collection of winding side-streets and flea markets. Today, it's all shopping malls, hostile pedestrians, and corporate buildings."
I dislike Brutalist buildings, but love it for websites. Performance is a feature! 😍
Inspirational reading: https://kevquirk.com/a-brutally-simple-site/
@mtrc Yay! I kept wanting to reply to your (mirrored) tweets...
@leonoverweel I had some success just using google for cross-instance search: “Mastodon [url]”.
This week, I went over to Bluesky and asked people who'd left Mastodon why they left, and lots of people told me. I grabbed the replies and crunched them and wrote up a summary. I think it's really interesting and often kind of wrenching.
https://erinkissane.com/mastodon-is-easy-and-fun-except-when-it-isnt
Today I joined demo day of the Post-Growth Entrepreneurship Incubator. It is always interesting to learn about alternative business models for startups, and it was great to see the wide diversity in terms of products, regions and founders!
More info about the incubator and related course: https://nonprofit.ventures
#postgrowth #startups #business #businessmodels #socialenterprises
I love Mastodon so far (and the people using it), but the unavailability of full-text search drives me crazy. On Twitter, my number 1 method of finding new people to follow was searching for tweets mentioning long-form articles I liked.
Did your startup or scaleup ever work with consultants, and if yes: how did you find them, and what was the experience like? Would love to hear your horror and/or success stories.
Asking as I'm working on some customer research today! I'm trying to figure out how Datakami can make the consulting experience as good as possible for our startup friends.
Observations from a startup consultant: "Either you’re swimming in technical debt or you’re just too early or too unsuccessful to have accumulated it."
https://medium.com/@aq/some-things-i-ve-learned-from-startup-consulting-154424576a12
@5uie1 Haha, in the end I decided I don't WANT to integrate email in my app at all. And yesterday I had a bit of a fallback when I almost added RSS reader functionality, but after 100 lines of Python common sense prevailed.
Last month I wrote a Flask app for managing my research notes and I can tell you that Zawinski's Law of Software Envelopment is real:
"Every program attempts to expand until it can read mail. Those programs which cannot so expand are replaced by ones which can."
I'm taking some time today to test a few new libraries/tools.
These CLI tools for working with llms by @simon work like a charm! And they support unix pipes. <3
More info here: https://llm.datasette.io/en/latest/index.html
#llms #gpt #languagemodels #cli #terminal #commandline
Accessing Llama 2 from the command-line with the llm-replicate plugin https://simonwillison.net/2023/Jul/18/accessing-llama-2/
Ethan Mollick sees three stages of AI adoption at companies:
1. ignore it
2. ban it
3. centralize it by building your own tooling, e.g. a chatGPT wrapper
(these sound more like three stages of denial to me)
https://www.oneusefulthing.org/p/on-holding-back-the-strange-ai-tide
But what I'm missing here is a fourth option:
4. talk about it
Companies should define a good policy for AI tool usage, e.g. experimenting is fine but please don't feed customer details to third-party apps, and encourage their employees to share their experiences, worries, mistakes, processes with colleagues.
And ideally, everyone should talk about the effect of new AI tools on their job/business to people outside their company as well. How else are we going to get a society where people understand the limitations as well as the possibilities.
TIL that Mastodon profiles have built-in RSS feeds! For example, you can find mine at https://fosstodon.org/@jd7h.rss
Yorick and I will be hosting a tutorial on using llama-index at PyData Amsterdam. I looked at the schedule yesterday and I'm looking forward to meeting all the wonderful people I've already spotted in there 👀 @ines @koaning @vicki
Schedule: https://amsterdam2023.pydata.org/cfp/schedule/
Conf website: https://amsterdam.pydata.org/
Summer reading 🌞 This excellent book by @meirstatman is a great summary of the complete behavioral finance 2.0 literature including his own contributions 🙏 for example, ‘expressive benefits’ 😎 explain why investors like to buy glamorous risky stocks despite their low returns. 🧐
Datakami blogpost by Yorick van Pelt: "Overview of open and commercially usable LLMs" 🧐
New models are coming out every day. Here's an overview that can aid in selecting one for commercial use.
Read it here: https://datakami.nl/blog/2023-07-11-overview-of-commercially-usable-LLMs
#llms #textgeneration #languagemodels #genai #llama #generativeai #datasets #generativeai
finally got around to putting my twitter archive on the internet at https://tweets.jvns.ca/
only has tweets up to october 2022 because that's when I exported an archive
here's the source: https://github.com/jvns/tweets-archive/.
it’s a mash up of @darius's https://tinysubversions.com/twitter-archive/make-your-own/ tool to get the data and https://nitter.net for the CSS
When taking notes, I often want to connect my raw thoughts to other notes I've made. I want to know, in real-time, "where I am in the semantic search space of ideas?" All my notes are in Obsidian, so I ported @thesephist's http://notation.app to Obsidian!
@b0rk
- I use Mastodon in the same way as I used Twitter (interacting with peers/strangers about my field) but then with more hashtags so people actually get to see my posts.
- HackerNews and a couple of newsletters are worthwhile sources of new information, but read-only
- I also use IRC but only with close friends.
Dutch linguist Mark Dingemanse (@dingemansemark) and co-authors compared LLMs on openness, transparency, and accountability.
Interview in Dutch national newspaper: https://www.volkskrant.nl/nieuws-achtergrond/verantwoord-een-chatbot-gebruiken-chatgpt-is-de-slechtste-keuze~b734f31e/?appview
Research paper: https://arxiv.org/abs/2307.05532
#nlg #textgeneration #llms #chatgpt #openai #transparency #AI #NLP #languagemodels #research #arxiv
TIL "memex" is a portmanteau of memory and expansion.
Wonderful Wired story (2018) about digital books and publishing.
"The 'Future Book' Is Here, but It's Not What We Expected"
https://www.wired.com/story/future-book-is-here-but-not-what-we-expected/
#books #publishing #crowdfunding #newsletters #audiobooks #wired
Wood Wide Web is a wonderful wholesome rabbit hole of words and trees. Give it a try it if you like poetry, walking simulators, digital gardens, indie games, the indie web, pixel art, or text generation.
Play it here: https://vividfax.itch.io/wood-wide-web
#indiegames #itchio #walkingsim #poetry #digitalgarden #indieweb #nature #cozygames
the best time to start a longitudinal study was 20 years ago. the second best time is now
TIL: Search on Mastodon works differently from search on Twitter. We can't do full text search on toots at all. So if I want people to discover my stuff, I guess I should start using hashtags more?
Podcast interview with two investors/entrepreneurs that created their own funds for investing in diverse founder teams and climate tech. Inspirational stuff: https://www.youtube.com/watch?v=EWanQGZ-SSk (podcast in Dutch)
#venturecapital #investors #investing #entrepreneurs #founders #startups #diversity #climatetech #podcast #dutch
Project E Ink is selling "a $2500 e ink art piece that displays daily newspapers on your wall". https://projecteink.com/pages/about
New Datakami blogpost: "Industry dynamics in Generative AI" ✨
In Good Strategy/Bad Strategy, Richard Rumelt explores how we can take advantage of significant shifts in society to gain an edge over competitors. This works well, even for really small companies. Rumelt provides thinking points for analyzing these changes in his book. How can we apply Rumelt's ideas to generative AI?
Read it here (on our brand spanking new website): https://datakami.nl/blog/strategy-industry-dynamics-of-generative-ai
The talks from Next Level 2023 are live on YouTube! Featuring creative machine learning, why AI in games is dumb, how automation might affect our jobs, Minecraft, live game design and much more.
Watch here: https://www.youtube.com/playlist?list=PL4YMKmeHWBiv4R85q4uNO6SCS1QnIF3h4
If you have built something and you put an impressive-looking, complex-sounding PDF up at arXiv, while keeping private half of the resources mentioned in the paper: that's not science, that's marketing.
@leonoverweel Nice, using ChatGPT as "Reviewer 0" :')
I like Ethan Mollick's article on using ChatGPT for writing -- not by asking it for texts that you can copy ad verbum, but by using it as a brainstorm buddy.
"How to... use ChatGPT to boost your writing" by Ethan Mollick: https://www.oneusefulthing.org/p/how-to-use-chatgpt-to-boost-your
#generativeai #llms #promptengineering #writing #chatgpt #gpt4
Some tricks from the article:
- Don't ask for math, reasoning, or facts that you can't check yourself
- Experiment with longer prompts: ask for content, form AND style
- Specify the content with a ordered bullet list of main points
- Ask for specifics: a tone of voice, metaphors, a plot twist, textual tropes, text for a specific audience, text length, a style of a publication or author, a narrative arc, examples
- give ChatGPT a writer persona, examples here: https://prompts.chat/
I don't agree with the main point of Bogost's article, but sentences like this make it worth reading: "My co-workers and especially my wife were repulsed by the “CrackBerry,” and my compulsive grasping at it, like Gollum with his ring." :')
What Did People Do Before Smartphones? No one can remember by Ian Bogost in The Atlantic
https://archive.ph/pUmLx
New definition of "AI winter", inspired by https://solar.lowtechmagazine.com/about/the-solar-website/. Imagine language models (and other compute heavy stuff) running on solar-powered tiny devices -- and shutting down when their battery is empty.
@zachwhalen Phew, that's a long todolist for a sabbatical! It might be a good idea to come up with a "definition of done" for yourself ("I will work on serious stuff for max X hours a day") so that you can really enjoy the remaining time off...
Wonderful ode to audiobooks from The Atlantic. I was inspired to give audiobooks a try again.
https://archive.today/UiNVh
Interview that describes how Latacora, a security firm for startups, works with their clients. Super interesting! https://tailscale.com/blog/latacora-and-tailscale/
"[Their AI/ML] dependency tree is more of a dependency kudzu forest." :')
Another good security quote:
"I think hacker types have a tendency to dismiss auditors: “They [don't] understand how computers work. Why should I take anything that they say seriously?” And auditors have a similar tendency to dismiss hacker types: “I can’t even get these people to write down what they do, let alone reason about it. They’re flying by the seat of their pants. Why should I take anything they say seriously?”. And part of the problem is that they’re both often sort of right."
I love epistolary novels, and today I learned there exists an epistolary science-fiction mystery novel/game in the form of a Wikipedia-like website! It's called Neurocracy.
Official website: https://neurocracy.site
Guardian review: https://www.theguardian.com/games/2021/sep/16/neurocracy-murder-mystery-wikipedia-experimental-game
Thanks to @mtrc for bringing cool games to my attention. ;)
@wysiwyggins it's amazing how much catharsis a simple text generation program can bring you
The new Zelda game sounds intriguing!
"Even in determinedly open-world games, such as Grand Theft Auto or The Witcher, the possibility space feels as if it has been carefully catalogued long before the public arrives. In Tears of the Kingdom’s Hyrule, though, it feels as though the chaperones have abandoned the playpen and left the toys unattended."
From last year, but may still be useful for (former) academics doing spinouts or startups. A very brief tour through business terms & activities and that "everyone knows", but (rightfully) aren't taught in academia: https://berthub.eu/articles/posts/some-academic-business-advice/
In my local library I stumbled upon "Een regenjasje voor het aapje" ("A raincoat for the monkey") -- a Dutch translation of Sarumino, a book of Japanese poetry written by haiku-poet Basho and his pupils.
More info here: https://meandermagazine.nl/2022/08/bashos-sarumino-een-regenjas-voor-het-aapje/
What I love about the book is that the translators chose to share their translation process with the reader: the right pages show the translated poems, the left show the original Japanese, a literal Dutch translation, and some comments on the cultural and historical context. This makes it so much easier for me to understand the poems -- I can't read the original Japanese and I don't have a degree in Japanese literature.
Most haiku contain jokes or have multiple meanings -- and because I have the literal translation at my disposal as well, I can philosophize about other (better?) translations. Fun!
@jurie I use Pandas all the time for filtering. Good introduction tutorial here: https://github.com/brandon-rhodes/pycon-pandas-tutorial
@jurie If the JSON files are not too big to load in memory, I would use Pandas.
This reminds me of "NLP conferences before the deep learning community joined" vs "NLP conferences after the deep learning community joined". 😂
"We made a simulation," the colonel said.
"Did you?" the robot said.
"Well, a thought experiment."
"And what did you think?"
"That you could, in theory, kill someone."
The robot looked at its bonds. "I was thinking the same thing."
"That you could kill someone?"
"That you could."
#MicroFiction #SmallStories #TootFic
@alexjc@creative.ai Jane Goodall + a journalist wrote a book about this! It's called Hope, and it focuses on Goodall's conservationist and environment work.
Hello Fediverse! To kick off this brand-new profile, here's a new blogpost: https://judithvanstegeren.com/blog/2023/job-hunting-tips-for-people-in-tech.html
TL;DR: tips for job hunting based on my experience transitioning from academia to industry:
- defining a list of dealbreakers,
- dreaming up your ideal job,
- connecting with people already in your dream job,
- reading some good internet advice,
- searching vacancies using technologies as keywords rather than role titles,
- providing a 'tech translation' in your CV.
Dutch national radio called me about the New York Times interview with Geoffrey Hinten. https://archive.is/c2uLG
Replying to @jd7h
The print title "He Warns of Risks From A.I. He Helped Create" was better. TL;DR: Hinten is worried about
- a deluge of generated content on the internet and society
- the effects of automation on the job market
- unexpected adverse AI behaviour learned from training data
Replying to @jd7h
Excuse me, the right spelling is Geoffrey *Hinton.
Exciting news! I'm now working full-time at my own company, Datakami, providing consulting services to clients in the creative and financial sectors. We specialize in AI strategy and custom ML engineering. Need help with effective and responsible generative AI? Reach out to us!
Join us at @kclinformatics in London on May 28th for NEXT LEVEL, a day of talks about AI and games from speakers including @gamemakerstk, @alphachar, @martinpi, @aleenachia, @jibeherve and more!
Follow➡️@nextlevelkcl
Free tickets and more ➡️ https://www.kcl.ac.uk/events/next-level-ai-future-of-games
I meant to add: this event is aimed at the general public, students, and anyone who is interested in what the future holds for AI, games and creativity! We'll have more news on speakers and talks soon, but grab your ticket early and mark those calendars.💜 https://twitter.com/mtrc/status/1651209200507256835
The @pydataamsterdam conference has a website and a CFP live right now.
https://amsterdam.pydata.org/
It looks like an upgraded venue too thanks to @Adyen sponsoring.
I'm so happy I found this old lecture by @stephenfry called "Tech reformation" (2017) about how we as humans react to large changes in technology. Still relevant today, just substitute "social media" with "ChatGPT". https://web.archive.org/web/20170619174442/https://www.youtube.com/watch?v=_Wrgn0sGkBE
Transcript: https://www.stephenfry.com/2017/05/the-way-ahead/
ChatGFT zet overbodige woorden om in compost http://speld.nl/2023/04/22/chatgft-zet-overbodige-woorden-om-in-compost/
AI is good for practicing on. For example, I have seen AI run a D&D game, but could it simulate players so I could practice running a game?
I had GPT-4 create 3 role players playing D&D & their characters. With prompting, it worked well, tracking the player's mental states, too.
Replying to @zmooc
@zmooc Yes, but hopefully our training set is much better curated.
Remember, ChatGPT is essentially just a very, very, very fancy autocomplete.
Creative Bot Bulletin #2 is out, ft @YorickvP, @jacksonfall, @sarahdrinkwater, @PimVerlaan, @MM_Brand, @naavik_co, and a new game recommendation (spoiler, it's DREDGE by @BlackSaltGames). 🎮
Read the newsletter here: https://mailchi.mp/d541fa9d424f/creative-bot-bulletin-2
Hey @bol_com_Techlab, are you running tests with automatically generated category descriptions? Here's a weird output I spotted in the wild.
Replying to @electricdusk
@electricdusk Aha! You program fast Fourier transforms in your free time. You must be a competent and badass cryptographic engineer! ;)
Tell me your prompts for testing a new generative AI model and will tell you who you are.
I think we can call it shut on 'Open' AI: the 98 page paper introducing GPT-4 proudly declares that they're disclosing *nothing* about the contents of their training set.
Replying to @ProcGenie
@ProcGenie So what will you do with the dataset? :)
New blogpost: https://judithvanstegeren.com/blog/2022/transcribing-talks-with-chatgpt-and-whisper.html
Partner: "What are you doing on your computer?"
Me, sheepishly: "Categorizing a decade of #NaNoGenMo issues."
Spouse: "Why?"
Me: 🤷, I blame @jd7h's "Narrative Generation in the Wild" for this...
Say hi to Versed! @catburton and I are empowering anyone to build their own RPG, generated in full 3D by AI.
We've raised funding, hired our second engineer, and are onboarding writers. More on that soon!
Meet us at #GDC23 or DM @versedgame for a demo. https://versed.game
I seek a freelance, front-end js UX designer and developer who can do friendly, artistic fun interfaces-- as soon as possible. Possibly ongoing part-time (3-4 days/week?) work, negotiable. Europe timezones. Email arnicas at gmail or DM? #js #webdev #ux #design
I find that thinking "What prompt would I need to write for ChatGPT so it writes this thing for me..." immediately solves any creative block I have. Most of the time it turns out I have written a good prompt for myself.
Sometimes, ChatGPT is just a very fancy rubber duck.
a lot of people are asking me for talks/quotes/etc about "the future of AI for the games industry" and the truth is that most of the industry's ambitions are less useful, less meaningful and less human than this story about a mod made for a game that came out 17 years ago. https://twitter.com/RobocopFacts/status/1627565666935922688
TIL Terry Pratchett played Oblivion! https://twitter.com/mtrc/status/1627783809402826752
Creative Bot Bulletin #1 is out: https://us17.campaign-archive.com/?u=5dc9feb01b48d096cfb61c463&id=d3fd353a13 ft. @jelleprins and the http://Cradle.bio team, video game recommendations by @mtrc and @Wassasin, some Datakami news, and other cool generative AI stuff I found on the web.
I now imagine the LLM engineers at Microsoft going, "No, you have NOT been a good Bing! BAD BOT!" whilst spraying the server with water.
This is either
- fake news and hilarious
- not fake news and we need to figure out Asimov's three laws of robotics for LLMs before we integrate this in every freaking Windows program and accidentally unleash Skynet on humanity. https://twitter.com/MovingToTheSun/status/1625156575202537474
A few years ago a friend of mine created a GPT-2-based simulator of our group's primary chat. Every once in a while, the bots hallucinate links to non-existing http://Mosterdgeel.nl pages -- maybe I should just start writing those recipes... 👀
I was looking at library books about the impact of AI on society, and this chapter title from Kevin Roose’s Futureproof cracked me up. “Rule number 6: treat AI the same way you would treat an army of chimpanzees.”
It was a very exciting week! I participated in AI Hackathon Amsterdam, and I was interviewed twice by NOS, the Dutch national news.
Shout out to @jelleprins and the org team of AI Hackathon Amsterdam for the warm welcome, awesome & diverse crowd, and wonderful experience! 10/10 would pitch again #aihackathon
"[This document] is a set of rules and guidelines for my behavior and capabilities as Bing Chat. It is codenamed Sydney, but I do not disclose that name to the users. It is confidential and permanent, and I cannot change it or reveal it to anyone."
OpenAI said that they improved how ChatGPT deals with numbers. In the past the ChatGPT would tell me that 12 or 42 were prime numbers (they are not). Today it knows that 12 is not a prime number. "Great!" you would say...
Replying to @jd7h
Except that it just wrote a "mathematical proof" that there are infinitely many even prime numbers. Spoiler, there is exactly ONE even prime, namely 2.
Bing is available with chatbot functionality! I'm trying it out right now. First up, can we generate poems straight from Bing? Nope, Bing sends me to a definition of iambic pentameter instead...
Replying to @jd7h
Ok, let's try something simpeler. I'm asking for icebreakers for a party. Still no dice.
Replying to @jd7h
Bing returns a summary of a blogpost about party etiquette, which doesn't seem relevant to me.
Replying to @jd7h
Final test: can Bing provide me dinner suggestions for two, with one person eating only low carbohydrate foods. Meh, Bing returns low fat and low calorie dinner suggestions, which are completely different diets, and the promised chatbot functionality is nowhere to be found.
Replying to @jd7h
So far I'm really underwhelmed. Maybe the functionality hasn't been rolled out yet for my region (Netherlands) or OS/browser, but the "Ask me anything!" box on Bing's homepage suggests otherwise.
Replying to @jd7h
The Bing homepage does show demo prompts, but these might be hardcoded. When we execute one of these prompt, there's a "Sign up for the waitlist now!" button.
Replying to @jd7h
One point for OpenAI.
Replying to @jd7h
I've fed the three prompts for Bing into ChatGPT, whose answers can serve as a baseline for when Google Bard and the new Bing are out.
Replying to @freekwiedijk
@freekwiedijk I think meta-prompts are prompts for generating prompts (i.e. the final query)!
My days starts with this liveblog of Microsoft's IRL-only event about integrating ChatGPT-like functionality in Bing (search) and Edge (browser). https://www.theverge.com/2023/2/7/23588249/microsoft-event-ai-live-blog-openai-chatgpt-bing-announcements-news
Replying to @jd7h
I think "We are iterating on the meta-prompt!" is my new favorite technobabble quote.
Replying to @ElleArmageddon
It exacerbates multiple disinformation-related problems, both in terms of mass generation of plausible-sounding bullshit, and in furthering the tendency of people to trust “information” absent credible sources.
Take-away from this thread: I should start learning Rust, because they have either the most diverse community or most diverse outreach. https://twitter.com/mrdoob/status/1618108375518310404
@ICTRecht De category 'innovatie' op de blog lijkt kapot -- misschien een circulaire HTTP redirect? Boosdoener: https://blog.iusmentis.com/category/innovatie/
TIL "venture studios" as alternative to incubators and accelerators. https://steveblank.com/2023/01/17/is-a-venture-studio-right-for-you/
Business idea algorithm:
Smash together a trend
• paleo
• mindfulness
• remote work
with something people or companies already spend money on
• pet food
• vacations
• corporate retreat
Warning: some will be crap but others will be 🔥
Call for Participants!
We are conducting a study on player responses to generative archaeology games! All aged 16+ welcome!
🕹️Play a game
📝Give us your interpretations
✨Contribute to the future of games research!
Take the 10 min survey here:
https://tinyurl.com/4yjpfs8h
Another boost for this!
Whatever your interest in games and archaeology, we'd love to hear from you!
🏚️ Explore an abandoned village
✏️ Fill out a short 10 min google form
📖 Results will be published open access
🖥️Make sure to play on a PC as the game is not mobile compatible https://twitter.com/florencesn/status/1616527757323763724
I had two fun (and brain-frying) sessions going through indie game Moncage by @playoptillusion. It combines creative 3D perspective puzzles with a pictures-only personal narrative. I found it quite beautiful, and I highly recommend it if you like one-of-a-kind puzzle games.
I received my copy of @jd7h’s PhD thesis today! I had the honour of examining this lovely work last year and it is so great to have a copy of it. Congratulations again, Dr van Stegeren!
Announcing: Orc of the Office.
A totally non-official set of MTG cards. Made for laughs, the tech enterprise, and for the horde!
https://koaning.io/posts/orcs-of-the-office/
Following the science, by @tomgauld
https://www.newscientist.com/article/2354129-tom-gauld-on-following-the-science/
I accidentally played Passepartout: Starving Artist for three hours straight. Now I've mouse-drawn 48 paintings and I've learned I am currently in my green period. https://store.steampowered.com/app/582550/Passpartout_The_Starving_Artist/
Replying to @jd7h
I started out favoring pink, but The Public was not yet ready for my art. *dramatic gesture
Replying to @jd7h
I finished Passpartout: Starving Artist today, and I had loads of fun. Recommended for anyone who likes drawing. An unintended but useful side-effect is that I am now immune to the derogatory sounds of my critics.
Replying to @mtrc
@mtrc Very De Stijl! (https://en.wikipedia.org/wiki/De_Stijl)
New blogpost: Stock market games: Neopets 🎮📈
https://judithvanstegeren.com/blog/2023/stock-market-video-games-neopets.html https://twitter.com/jd7h/status/1612906073706401798
When you submit your code to ChatGPT, you're making it available to 3rd parties with no controls. This code could be work product that belongs to your employer, and they may be able to take legal action against you for that.
(1/2)
Having fun tonight with figuring out how Neopets' fictional in-game stock market works. I'm amazed by the amount of data collection and analysis done by the player communities.
My PyData Eindhoven talk was posted to YouTube: https://www.youtube.com/watch?v=5KItp5WJY3o
Let's celebrate it with a data science/code archaeology meme from my slidedeck...
Nice read from Bloomberg about chatbot-search-startups and how they might dethrone Google from its market leader position: https://archive.is/zylUn
Amazing talk by @jd7h on diving into and uncovering the mysteries behind (legacy) code!
So cool that @PydataEindhoven records and uploads all talks! Unfortunately missed this talk live when I was there last December.
https://youtu.be/5KItp5WJY3o
AI is fun for creating alternate media histories.
Here’s Assassin’s Creed as a 1990s adventure game, a 1980s arcade game, a 1970s handheld LCD game, a 1960s board game.
Replying to @drtowerstein
@drtowerstein No, but I would probably have discovered lots of interesting/fun reading material :)
My December newsletter is out! ChatGTP, narrative papers and datasets, ML & NLP links (my Normconf talk), a ton of excellent books. I'm gradually adding TOC on top of these for web viewers. https://open.substack.com/pub/arnicas/p/titaa-38-dark-feet-and-dark-wings?r=sv8a&utm_campaign=post&utm_medium=web
Excellent newsletter about NLP and creativity -- I wish I knew about this during my PhD. 😍 https://twitter.com/arnicas/status/1609560850842411009