Feed

Page 16 of 17

Erik Craddock
Erik Craddock@eriklink

Model Context Protocol has prompt injection security problems

As more people start hacking around with implementations of MCP (the Model Context Protocol, a new standard for making tools available to LLM-powered systems) the security implications of tools built on that protocol are starting to come into focus.

Model Context Protocol has prompt injection security problems

Simon Willison’s Weblog

Model Context Protocol has prompt injection security problems

As more people start hacking around with implementations of MCP (the Model Context Protocol, a new standard for making tools available to LLM-powered systems) the security implications of tools built …

linkby Simon Willisonvia Simon Willison’s Weblog
0 Replies0 Boosts0 Likes
Erik Craddock
Erik Craddock@eriklink

Import AI 406: AI-driven software explosion; robot hands are still bad; better LLMs via pdb | Import AI

Researchers with Forethought, an AI research organization, think it’s likely that modern AI research will yield AI systems capable of building their successors. Forethought expects that at some point in the future it’ll be possible to build AI Systems for AI R&D Automation (ASARA).

Why this matters – LLMs are more powerful than we think, they just need the right tools: Systems like this are yet another example of the ‘capability overhang’ which surrounds us – you can make LLMs better merely by pairing them with the right tools and, these days, you don’t need to do any adaption of the LLMs for those tools beyond some basic prompting. Put another way: if you paused all AI progress today, systems would continue to advance in capability for a while solely through the creation of better tools.
Read more: debug-gym: A Text-Based Environment for Interactive Debugging (arXiv).
Get the software here: debug-gym (Microsoft site).

Import AI 406: AI-driven software explosion; robot hands are still bad; better LLMs via pdb

Import AI

Import AI 406: AI-driven software explosion; robot hands are still bad; better LLMs via pdb

Welcome to Import AI, a newsletter about AI research. Import AI runs on lattes, ramen, and feedback from readers. If you’d like to support this, please subscribe. Subscribe now It seems likely that…

linkby Jack Clarkvia Import AI
0 Replies0 Boosts0 Likes
Erik Craddock
Erik Craddock@eriklink

MCP: The new USB-C for AI that’s bringing fierce rivals together

MCP has also rapidly begun to gain community support in recent months. For example, just browsing this list of over 300 open source servers shared on GitHub reveals growing interest in standardizing AI-to-tool connections. The collection spans diverse domains, including database connectors like PostgreSQL, MySQL, and vector databases; development tools that integrate with Git repositories and code editors; file system access for various storage platforms; knowledge retrieval systems for documents and websites; and specialized tools for finance, health care, and creative applications.

To make the connections behind the scenes between AI models and data sources, MCP uses a client-server model. An AI model (or its host application) acts as an MCP client that connects to one or more MCP servers. Each server provides access to a specific resource or capability, such as a database, search engine, or file system. When the AI needs information beyond its training data, it sends a request to the appropriate server, which performs the action and returns the result.

MCP: The new “USB-C for AI” that’s bringing fierce rivals together

Ars Technica

MCP: The new “USB-C for AI” that’s bringing fierce rivals together

Model context protocol standardizes how AI uses data sources, supported by OpenAI and Anthropic.

linkby Benj Edwardsvia Ars Technica
0 Replies0 Boosts0 Likes
Erik Craddock
Erik Craddock@eriklink

No elephants: Breakthroughs in image generation

Over the past two weeks, first Google and then OpenAI rolled out their multimodal image generation abilities. This is a big deal. Previously, when a Large Language Model AI generated an image, it wasn’t really the LLM doing the work. Instead, the AI would send a text prompt to a separate image generation tool and show you what came back. The AI creates the text prompt, but another, less intelligent system creates the image. For example, if prompted “show me a room with no elephants in it, make sure to annotate the image to show me why there are no possible elephants” the less intelligent image generation system would see the word elephant multiple times and add them to the picture. As a result, AI image generations were pretty mediocre with distorted text and random elements; sometimes fun, but rarely useful.

No elephants: Breakthroughs in image generation

oneusefulthing.org

No elephants: Breakthroughs in image generation

When Language Models Learn to See and Create

linkby Ethan Mollickvia One Useful Thing
0 Replies0 Boosts0 Likes
Erik Craddock
Erik Craddock@eriklink

Import AI 405: What if the timelines are correct? | Import AI

This article is full of all sorts of interesting information from questions about LLM consciousness to security threats to LLM agents with the capability of doing months of work.

The paper is worth reading because it represents an earnest attempt by a thoughtful human to confront the impossibly large question we’ll need to deal with in the next decade or so – how conscious might LLMs be?

Individuals working with AI performed just as well as teams without AI, showing a 0.37 standard deviation improvement over the baseline. This suggests that AI effectively replicated the performance benefits of having a human teammate – one person with AI could match what previously required two-person collaboration.

“By automating complex tasks previously requiring human ingenuity and extensive effort, AI models can significantly lower the barriers to entry for malicious actors of all attack levels,” Google writes. “”Our evaluations revealed that current AI cyber evaluations often overlook critical areas. While much attention is given to AI-enabled vulnerability exploitation and novel exploit development, our analysis highlights AI’s significant potential in under-researched phases like evasion, detection avoidance, obfuscation, and persistence. Specifically, AI’s ability to enhance these stages presents a substantial, yet often underestimated, threat.”

Significant and sustained growth: “We find that the 50% time horizon has been growing exponentially from 2019–2024 on our tasks,” METR writes. The analysis means METR thinks there’s a high chance AI systems will be able to tackle tasks that take a human a month (167 working hours) by 2030 – or potentially earlier, if a recent uptick in the trajectory due to the arrival of new reasoning models holds.

Import AI 405: What if the timelines are correct?

Import AI

Import AI 405: What if the timelines are correct?

Welcome to Import AI, a newsletter about AI research. Import AI runs on lattes, ramen, and feedback from readers. If you’d like to support this, please subscribe. Subscribe now Import A-Idea:What i…

linkby Jack Clarkvia Import AI
0 Replies0 Boosts0 Likes
Erik Craddock
Erik Craddock@eriklink

Revenge of the junior developer | Sourcegraph Blog

I believe the AI-refusers regrettably have a lot invested in the status quo, which they think, with grievous mistakenness, equates to job security. They all tell themselves that the AI has yet to prove that it’s better than they are at performing X, Y, or Z, and therefore, it’s not ready yet.

But from where I’m sitting, they’re the ones who aren’t ready. I lay this all out in detail, my friends, so you can help yourselves.

Regardless of why the luddites aren’t adopting it, they have lost. Junior devs have the high ground, and the battle is now over. Not only are junior devs on average adopting AI faster, but junior devs are also – surprise! – cheaper. If companies are going to make cuts to pay for their devs to win with tokens, which devs do you think they’re gonna keep?

Revenge of the junior developer

Sourcegraph

Revenge of the junior developer

The latest installment from Steve Yegge on viiiiibe coding and what that means for developer jobs.

linkvia Sourcegraph
0 Replies0 Boosts0 Likes
Erik Craddock
Erik Craddock@eriklink

Not all AI-assisted programming is vibe coding (but vibe coding rocks)

I’m concerned that the definition is already escaping its original intent. I’m seeing people apply the term “vibe coding” to all forms of code written with the assistance of AI. I think that both dilutes the term and gives a false impression of what’s possible with responsible  AI-assisted programming.

Vibe coding is not the same thing as writing code with the help of LLMs!

Simon Willison’s Weblog

Not all AI-assisted programming is vibe coding (but vibe coding rocks)

Vibe coding is having a moment. The term was coined by Andrej Karpathy just a few weeks ago (on February 6th) and has since been featured in the New York …

linkby Simon Willisonvia Simon Willison’s Weblog
0 Replies0 Boosts0 Likes
Erik Craddock
Erik Craddock@eriklink

Kagi is a better search engine than Google — but it costs $10 a month | The Verge

Using Kagi feels a lot like using Google a decade ago, and I mean that in a good way. You type in a search, and it returns a page full of links. It has image search, video search, maps, news, and even a podcast-specific tab I’ve found very useful. Search for something topical, and you’ll get a few links followed by a side-scrolling carousel of news stories. Search for a person, and Kagi virtually always starts with a short excerpt of their Wikipedia page.

The future of search isn’t Google — and it’s $10 a month

The Verge

The future of search isn’t Google — and it’s $10 a month

Paying for a search engine is a lot to ask, but you might not have a better option anymore.

linkby David Piercevia The Verge
0 Replies0 Boosts0 Likes
Erik Craddock
Erik Craddock@eriklink

Import AI 404: Scaling laws for distributed training; misalignment predictions made real; and Alibaba’s good translation model | Import AI

We are not making dumb tools here – we are training synthetic minds. These synthetic minds have economic value which grows in proportion to their intelligence. The ‘reward system’ of the world is flowing resources into the building of smarter synthetic minds. As we make these things smarter, they will more and more display a propensity to think about themselves as distinct from us.

**Really powerful AI could wreck society by making governments too powerful:
**_…The problem with AGI is that it could make governments way better, which destroys freedom…
_Researchers with Texas A&M University and the Foundation for American Innovation have considered how powerful AI systems could alter the balance of power between citizens and government. Their takeaway isn’t very reassuring – powerful AI systems are highly likely to either a) create a “‘despotic Leviathan’ through enhanced state surveillance and control”, or foster an “‘absent Leviathan’ through the erosion of state legitimacy relative to AGI-empowered non-state actors”.

Import AI 404: Scaling laws for distributed training; misalignment predictions made real; and Alibaba’s good translation model

Import AI

Import AI 404: Scaling laws for distributed training; misalignment predictions made real; and Alibaba’s good translation model

Welcome to Import AI, a newsletter about AI research. Import AI runs on lattes, ramen, and feedback from readers. If you’d like to support this, please subscribe. Subscribe now A whole bunch of 202…

linkby Jack Clarkvia Import AI
0 Replies0 Boosts0 Likes

Page 16 of 17