Tuesday, February 4, 2025

The Obtain: Understanding deep matter, and AI jailbreak safety


We are able to put a great determine on how a lot we all know concerning the universe: 5%. That’s how a lot of what’s floating about within the cosmos is strange matter—planets and stars and galaxies and the mud and fuel between them. The opposite 95% is darkish matter and darkish power, two mysterious entities aptly named for our incapacity to make clear their true nature.

Earlier work has begun pulling aside these dueling forces, however darkish matter and darkish power stay shrouded in a blanket of questions—critically, what precisely are they?

Enter the Vera C. Rubin Observatory, considered one of our 10 breakthrough applied sciences for 2025. Boasting the most important digital digicam ever created, Rubin is predicted to review the cosmos within the highest decision but as soon as it begins observations later this yr. And with a greater window on the cosmic battle between darkish matter and darkish power, Rubin may slim down present theories on what they’re manufactured from. Right here’s a take a look at how.

—Jenna Ahart

This story is a part of MIT Expertise Evaluate Explains, our sequence untangling the complicated, messy world of know-how that can assist you perceive what’s coming subsequent. You possibly can learn extra from the sequence right here.

Anthropic has a brand new solution to defend massive language fashions in opposition to jailbreaks

What’s new? AI agency Anthropic has developed a brand new line of protection in opposition to a standard type of assault referred to as a jailbreak. A jailbreak methods massive language fashions (LLMs) into doing one thing they’ve been educated to not, comparable to assist anyone create a weapon. And Anthropic’s new method might be the strongest defend in opposition to the assaults but.

How they did it: Jailbreaks are a type of adversarial assault: enter handed to a mannequin that makes it produce an sudden output. Regardless of a decade of analysis there’s nonetheless no solution to construct a mannequin that isn’t weak. However, as a substitute of attempting to repair its fashions, Anthropic has developed a barrier that stops tried jailbreaks from getting by means of and undesirable responses from the mannequin getting out. Learn the complete story.

Related Articles

LEAVE A REPLY

Please enter your comment!
Please enter your name here

Latest Articles