Autor Tema: El fin del trabajo (Leído 1729012 veces)

Saturio · « **Respuesta #2325 en:** Enero 14, 2025, 22:24:59 pm »

Cita de: senslev en Enero 14, 2025, 19:35:30 pm

https://techcrunch.com/2025/01/08/elon-musk-agrees-that-weve-exhausted-ai-training-data/

Citar
Elon Musk concurs with other AI experts that there’s little real-world data left to train AI models on.

“We’ve now exhausted basically the cumulative sum of human knowledge … in AI training,” Musk said during a livestreamed conversation with Stagwell chairman Mark Penn on X late Wednesday. “That happened basically last year.”

Musk, who owns AI company xAI, echoed themes former OpenAI chief scientist Ilya Sutskever touched on at NeurIPS, the machine learning conference, during an address in December. Sutskever, who said the AI industry had reached what he called “peak data,” predicted a lack of training data will force a shift away from the way models are developed today.

Indeed, Musk suggested that synthetic data — data generated by AI models themselves — is the path forward. “The only way to supplement [real-world data] is with synthetic data, where the AI creates [training data],” he said. “With synthetic data … [AI] will sort of grade itself and go through this process of self-learning.”

Other companies, including tech giants like Microsoft, Meta, OpenAI, and Anthropic, are already using synthetic data to train flagship AI models. Gartner estimates 60% of the data used for AI and analytics projects in 2024 were synthetically generated.

Microsoft’s Phi-4, which was open sourced early Wednesday, was trained on synthetic data alongside real-world data. So were Google’s Gemma models. Anthropic used some synthetic data to develop one of its most performant systems, Claude 3.5 Sonnet. And Meta fine-tuned its most recent Llama series of models using AI-generated data.

Training on synthetic data has other advantages, like cost savings. AI startup Writer claims its Palmyra X 004 model, which was developed using almost entirely synthetic sources, cost just $700,000 to develop — compared to estimates of $4.6 million for a comparably sized OpenAI model.

But there as disadvantages as well. Some research suggests that synthetic data can lead to model collapse, where a model becomes less “creative” — and more biased — in its outputs, eventually seriously compromising its functionality. Because models create synthetic data, if the data used to train these models has biases and limitations, their outputs will be similarly tainted.

Pues, vaya, a pesar de eso, los generadores de imágenes siguen sin saber cómo funciona un reloj (o lo que es un reloj o lo que es el tiempo).
Prueben a pedirles una imagen de un reloj (analógico) señalando las cuatro y media.

Cadavre Exquis · « **Respuesta #2326 en:** Febrero 25, 2025, 07:16:37 am »

https://x.com/ggerganov/status/1894057587441566081

Saludos.

muyuu · « **Respuesta #2327 en:** Noviembre 29, 2025, 18:45:24 pm »

https://x.com/MarioNawfal/status/1994488967379529914

Citar

🇪🇸🇺🇸 SPAIN HANDED TESLA THE KEYS TO EUROPE: UNLIMITED NATIONWIDE FSD TESTING, NO DRIVER REQUIRED

Yes, this is massive news for Tesla.

Spain quietly dropped the mother of all regulatory gifts in July 2025: the ES-AV framework that puts the country straight into Phase 3.

Remote monitoring allowed, no mandatory safety driver, full public road access.

Tesla immediately got approval for 19 vehicles with unlimited testing across the entire country.

This isn't some small pilot program in a parking lot.

This is real-world, all-roads, no-human-behind-the-wheel testing at scale, exactly what Tesla needs to train FSD Supervised and Robotaxi to European driving chaos.

Spain just became Tesla's European data goldmine overnight.

While California is still choking on red tape and Germany moves at a bureaucratic snail's pace, Spain said, "Come get your miles."

2026 Robotaxi fleets in Madrid and Barcelona suddenly look a lot more real.

Is Europe waking up? If so, Tesla is the alarm clock.

Source: ES-AV Framework Programme (July 2025),
@KRoelandschap
,
@Tesla
,
@teslaeurope

muyuu · « **Respuesta #2328 en:** Diciembre 28, 2025, 15:40:19 pm »

https://www.youtube.com/watch?v=-m9kY1cpmgw

Kissinger and the Future of AI ft. Eric Schmidt

39:16 (parte)

Citar

00:39:20.461 --> 00:39:24.138
THE MOST, YOUR LINE OF SIGHT.
≫ WHY IS THIS MADNESS

00:39:24.138 --> 00:39:26.138
OCCURRING?

00:39:26.944 --> 00:39:29.759
IT MUST BE A BUBBLE AND IT'S
GOING TO CRASH. NO. IT'S NOT A

00:39:29.759 --> 00:39:31.759
BUBBLE.

00:39:33.066 --> 00:39:35.066
IF ANYTHING IT'S UNDER HYPED
BECAUSE YOU ARE

00:39:36.274 --> 00:39:38.274
FUNDAMENTALLY AUTOMATING
BUSINESSES AND THE REASON

00:39:38.274 --> 00:39:40.274
PEOPLE ARE SPENDING THIS AMOUNT
OF MONEY IS TO AUTOMATE THE

00:39:40.275 --> 00:39:43.391
BORING PARTS OF THEIR
BUSINESS. WHETHER IT'S BILLING

00:39:43.391 --> 00:39:45.391
OR ACCOUNTING

00:39:45.994 --> 00:39:48.704
OR PRODUCT DESIGN OR DELIVERY
OR INVENTORY OR WHATEVER.

00:39:48.704 --> 00:39:50.704
PEOPLE ARE AUTOMATING THOSE.
AND THERE'S AN AWFUL LOT THERE.

00:39:51.513 --> 00:39:54.622
THINK ABOUT MEDICINE. CLIMATE
CHANGE AND ENGINEERING NEW

00:39:54.622 --> 00:39:56.622
SCIENCE.

00:39:59.538 --> 00:40:01.538
IT'S EXTRAORDINARY.

00:40:06.945 --> 00:40:10.553
≫ WHAT EXCITES YOU THE MOST OF
THE ONES THAT YOU'VE SEEN THAT

00:40:10.553 --> 00:40:14.861
YOU HAVE A LINE OF SIGHT TO
THAT THE REST OF US ARE NOT

00:40:14.861 --> 00:40:16.861
SEEING.
≫ SORRY.

00:40:16.861 --> 00:40:18.861
I HAVE A COUGH.

00:40:22.103 --> 00:40:24.103
I APOLOGIZE.

00:40:26.325 --> 00:40:29.337
≫ WE CAN ALL SEE IN OUR OWN
IMAGINATION WHAT WE ARE

00:40:29.337 --> 00:40:31.337
THINKING OF BUT WE WILL SEE
WHAT ERIC SAYS.

00:40:32.646 --> 00:40:34.646
≫ WHEN I STARTED IN HIGH SCHOOL
I WAS AN

00:40:36.254 --> 00:40:40.461
EARLY PROGRAMMER AND I
DELIGHTED IN WRITING CODE.

00:40:40.461 --> 00:40:42.461
WHEN I WENT TO COLLEGE AND
GRADUATE SCHOOL THAT'S ALL I

00:40:42.461 --> 00:40:45.570
WANTED TO DO. I WAS THE
DEFINITION OF A NERD AT THE

00:40:45.570 --> 00:40:49.071
TIME. AND EVERYTHING THAT I DID
IN MY 20s WHICH GOT ME TO WHERE

00:40:49.071 --> 00:40:51.071
I AM HAS NOW BEEN COMPLETELY
AUTOMATED.

00:40:54.386 --> 00:40:56.386
EVERY ASPECT OF THE PROGRAMMING
I DID, EVERY ASPECT OF DESIGN

00:40:56.386 --> 00:40:58.497
IS NOW DONE BY COMPUTERS.

00:41:01.702 --> 00:41:04.624
I RECENTLY HAD IT RIGHT A WHOLE
PROGRAM FOR ME. WATCHING IT

00:41:04.624 --> 00:41:06.624
DEFINE THE CLASSES AND DETAIL
OF THE INDIRECTION AND SO

00:41:06.624 --> 00:41:10.680
FORTH. HOLY CRAP. THE END OF
ME.

00:41:13.894 --> 00:41:15.894
AND I'VE BEEN DOING PROGRAMMING
FOR

00:41:17.154 --> 00:41:20.971
55 YEARS. SO TO SEE SOMETHING
START AND END IN FRONT OF

00:41:23.980 --> 00:41:25.980
YOUR OWN LIFE IT'S REALLY
PROFOUND.

00:41:26.807 --> 00:41:30.122
COMPUTER SCIENCE IS NOT GOING
AWAY. AT LEAST UNTIL COMPUTER

00:41:30.122 --> 00:41:35.845
SCIENTIST GETS REPLACED WILL BE
SUPERVISING THIS. BUT THE

00:41:35.845 --> 00:41:38.366
ABILITY TO GENERATE CODE AT THE
POWER THAT THESE SYSTEMS CAN

00:41:38.366 --> 00:41:40.366
DO IS REVOLUTIONARY.

00:41:42.471 --> 00:41:44.471
EACH AND EVERY ONE OF YOU HAS A
SUPERCOMPUTER AND SUPER

00:41:44.471 --> 00:41:46.471
PROGRAMMER IN YOUR POCKET.

00:41:47.263 --> 00:41:50.166
NOBODY HERE IS A TERRORIST.
IT'S ALWAYS EASIER TO USE

00:41:50.166 --> 00:41:53.476
NEGATIVE EXAMPLES. THERE'S
PLENTY OF I WILL USE A

00:41:53.476 --> 00:41:55.476
STEREOTYPE, YOUNG MEN LIVING IN
THE

00:41:56.681 --> 00:41:58.681
BASEMENT, THEIR MOTHERS GIVE
THEM FOOD AND

00:42:03.488 --> 00:42:05.488
BASIC THEIR -- THEY SIT THERE.

00:42:08.416 --> 00:42:10.416
THEY ALL HAVE THE ABILITY TO
USE THESE TOOLS TO BUILD

00:42:10.416 --> 00:42:13.229
INCREDIBLY POWERFUL SYSTEMS.
CYBER ATTACKS, OTHER THINGS.

00:42:16.447 --> 00:42:18.447
THERE'S SOME EVIDENCE

00:42:21.261 --> 00:42:23.261
THAT THE FELLOW WHO KILLED THE
INSURANCE EXECUTIVE WAS INTO

00:42:23.261 --> 00:42:25.273
SOME OF THIS, PEOPLE WERE
LOOKING AT SOME OF HIS

00:42:25.273 --> 00:42:29.370
WRITINGS. OF COURSE HE IS IN
JAIL NOW BUT THAT HE WAS

00:42:31.073 --> 00:42:33.385
SOMEHOW INFLUENCED. IT'S AN
EXAMPLE OF SOME OF THE DARKEST

00:42:33.385 --> 00:42:35.385
RECESSES OF HUMANITY.

00:42:37.099 --> 00:42:39.099
YOU GIVE THOSE PEOPLE THESE
KINDS OF TOOLS, WE HAVE TO BE

00:42:39.099 --> 00:42:41.099
READY. THE INDUSTRY IS WELL
AWARE OF THIS AND WORKING ON

00:42:41.099 --> 00:42:43.497
IT. IT'S VERY IMPORTANT THAT
DEFENSIVE SYSTEMS BE CAPABLE.

00:42:46.716 --> 00:42:51.046
THE EVENTUAL SOLUTION TO A.I. ,
GOOD A.I. FIGHTING BAD A.I..

00:42:53.266 --> 00:42:55.266
THAT'S HOW IT WILL RESOLVE
ITSELF.

00:42:56.682 --> 00:42:59.302
≫ I WANT TO ASK YOU ABOUT THE
U.S. CHINA RIVALRY IN A.I.

00:43:02.508 --> 00:43:04.927
AS YOU SEE IT. APOLOGIES IF
IT'S NOT CLEAR ENOUGH.

00:43:08.130 --> 00:43:10.130
BUT IT SUGGESTS IF WE TAKE A
SERIES

00:43:13.135 --> 00:43:15.135
OF INDICES, IF YOU LOOK AT THE
PERFORMANCE GAP IN

00:43:16.247 --> 00:43:19.459
JANUARY OF 24 IT WAS
SIGNIFICANTLY LARGER THAN IT

00:43:21.970 --> 00:43:23.970
IS TODAY.

00:43:26.687 --> 00:43:28.893
AND WHAT WE MAKE OF THIS AND
WHAT WE MAKE OF THE LIKELY

00:43:28.893 --> 00:43:30.893
FUTURE?

00:43:31.917 --> 00:43:33.917
≫ THE CHART IS CORRECT.

00:43:36.785 --> 00:43:38.785
BUT THE PEOPLE WHO ARE
INFLUENCED BY THIS CLAIM THAT

00:43:38.785 --> 00:43:40.785
IT'S NOT GOING TO BE TRUE FOR
LONG.

00:43:42.695 --> 00:43:44.695
BECAUSE THE REASONING
REVOLUTION REQUIRES SO MANY

00:43:44.695 --> 00:43:46.695
CHIPS AND SO MUCH OF THE MAGIC
THAT THE SAN FRANCISCO PEOPLE

00:43:46.695 --> 00:43:50.830
HAVE INVENTED, USING THAT AS A
SORT OF MONIKER. THAT THE GAP

00:43:50.830 --> 00:43:54.144
WILL WIDEN. MY OWN VIEW IS THAT
THE GAP WILL WIDEN BUT FOR

00:43:54.144 --> 00:43:56.144
DIFFERENT REASONS.

00:43:57.453 --> 00:43:59.453
I THINK THAT THE CHINESE FOCUS
IS LARGELY AS I MENTIONED ON

00:43:59.453 --> 00:44:01.453
EMBEDDING A.I.

00:44:02.570 --> 00:44:05.975
IN EVERYTHING. TOASTERS, CARS,
THEY ARE MOVING MUCH

00:44:06.882 --> 00:44:10.284
MORE QUICK. THE VAST MAJORITY
OF HUMAN ROBOTS WILL BE

00:44:13.296 --> 00:44:15.296
CHINESE AND HIGH-POWERED AND

00:44:16.396 --> 00:44:18.396
MANUFACTURED BECAUSE THEY KNOW
HOW TO DRIVE THE COST OF THINGS

00:44:18.396 --> 00:44:20.396
DOWN. THE SUPPLY CHAINS ARE
INCREDIBLE, THE COST

00:44:20.396 --> 00:44:24.020
MANAGEMENT, ALL OF THE
DIFFERENT STUFF. SO MY GUESS IS

00:44:24.020 --> 00:44:26.020
THAT IT'S

00:44:27.226 --> 00:44:29.226
TRUE THAT THE GAP WILL PROBABLY
GET

00:44:31.831 --> 00:44:33.831
LARGER BUT THE REAL QUESTION IS
WILL YOU AS A CONSUMER

00:44:33.831 --> 00:44:35.831
ULTIMATELY HAVE A BETTER
EXPERIENCE WITH A CHINESE

00:44:35.831 --> 00:44:39.651
PRODUCT THAN A U.S. PRODUCT.
THE ANSWER IS PROBABLY THE

00:44:39.651 --> 00:44:41.651
CHINESE PRODUCT AND THAT'S OF

00:44:41.860 --> 00:44:43.860
CONCERN.

00:44:48.698 --> 00:44:50.698
≫ THERE IS A HALF-DOZEN
QUESTIONS ABOUT WHICH PEOPLE

00:44:50.698 --> 00:44:53.513
ARE MAKING BETS AND YOU'VE
THOUGHT ABOUT IT DEEPLY. ARE WE

00:44:53.513 --> 00:44:55.513
GOING TO BET ON

00:44:57.923 --> 00:44:59.923
COMPUTER CHIPS AND SNACKS OR
BRAINS?

00:45:01.632 --> 00:45:04.934
CLOSED OR OPEN? ANOTHER ONE IS
ARE WE GOING TO BET WORKING

00:45:04.934 --> 00:45:06.934
HARD

00:45:08.135 --> 00:45:10.244
ON AGI OR APPLICATIONS?

00:45:13.446 --> 00:45:15.446
IF YOU GO ACROSS THE

00:45:17.251 --> 00:45:19.251
SPECTRUM HERE, IF I LOOK AT THE
CHINESE

00:45:21.256 --> 00:45:25.767
PIECE CERTAINLY THE GUYS AT
DEEP-SEA HAVE 200 PEOPLE WHO

00:45:29.075 --> 00:45:31.075
HAVE BRAINS THAT COST

00:45:32.082 --> 00:45:35.492
ONE 1000 OF THE COST OF A.I.
AND NOW THERE ARE

00:45:37.806 --> 00:45:40.316
SIX OTHER ALONG FROM THE SAME
SPACE. SO THAT ONE MAKES ME

00:45:40.316 --> 00:45:43.726
WARY. ON THE CLOSED VERSUS
OPEN, OUR

00:45:47.434 --> 00:45:49.434
LAST CONVERSATION YOU WERE
PRETTY MUCH

00:45:52.243 --> 00:45:54.243
CONCLUDED THAT OPEN IS GOING TO
BE CLOSED BUT ALL OF OUR

00:45:54.243 --> 00:45:56.243
COMPANIES ARE MOSTLY CLOSED. SO
WHAT ABOUT THAT?

Cadavre Exquis · « **Respuesta #2329 en:** Diciembre 28, 2025, 20:58:45 pm »

Citar

Sal Khan: Companies Should Give 1% of Profits To Retrain Workers Displaced By AI
Posted by EditorDavid on Sunday December 28, 2025 @03:37AM from the stopping-a-job dept.

"I believe artificial intelligence will displace workers at a scale many people don't yet realize," says Sal Kahn (founder/CEO of the nonprofit Khan Academy). But in an op-ed in the New York Times he also proposes a solution that "could change the trajectory of the lives of millions who will be displaced..."

"I believe that every company benefiting from automation — which is most American companies — should... dedicate 1 percent of its profits to help retrain the people who are being displaced."
Citar
This isn't charity. It is in the best interest of these companies. If the public sees corporate profits skyrocketing while livelihoods evaporate, backlash will follow — through regulation, taxes or outright bans on automation. Helping retrain workers is common sense, and such a small ask that these companies would barely feel it, while the public benefits could be enormous...

Roughly a dozen of the world's largest corporations now have a combined profit of over a trillion dollars each year. One percent of that would create a $10 billion annual fund that, in part, could create a centralized skill training platform on steroids: online learning, ways to verify skills gained and apprenticeships, coaching and mentorship for tens of millions of people. The fund could be run by an independent nonprofit that would coordinate with corporations to ensure that the skills being developed are exactly what are needed. This is a big task, but it is doable; over the past 15 years, online learning platforms have shown that it can be done for academic learning, and many of the same principles apply for skill training.
"The problem isn't that people can't work," Khan writes in the essay. "It's that we haven't built systems to help them continue learning and connect them to new opportunities as the world changes rapidly."
Citar
To meet the challenges, we don't need to send millions back to college. We need to create flexible, free paths to hiring, many of which would start in high school and extend through life. Our economy needs low-cost online mechanisms for letting people demonstrate what they know. Imagine a model where capability, not how many hours students sit in class, is what matters; where demonstrated skills earn them credit and where employers recognize those credits as evidence of readiness to enter an apprenticeship program in the trades, health care, hospitality or new categories of white-collar jobs that might emerge...

There is no shortage of meaningful work — only a shortage of pathways into it.
Thanks to long-time Slashdot reader destinyland for sharing the article.

Saludos.

muyuu · « **Respuesta #2330 en:** Enero 10, 2026, 00:53:06 am »

https://www.youtube.com/watch?v=zvN34ETmWuo

"Il reddito universale è inevitabile"

Salvatore Sanfilippo

-----

no lo veo igual pero es un punto de vista interesante

principalmente no lo veo igual porque no veo cómo la gente no va a seguir demandando un espectro virtualmente ilimitado de servicios provistos por humanos directamente, aunque no sean servicios de ningún tipo que se pueda proveer mediados por computación, desde luego no a corto ni medio plazo - podría enrollarme en detalles pero lo dejo ahí con la opinión de Salvatore Sanfilippo (antirez de Redis)

últimamente habla mucho de este tema

muyuu · « **Respuesta #2331 en:** Enero 10, 2026, 19:08:59 pm »

https://mathstodon.xyz/@tao/115855840223258103

“Erdos problem #728 was solved more or less autonomously by AI”

Recently, the application of AI tools to Erdos problems passed a milestone: an Erdos problem (#728 https://www.
erdosproblems.com/728) was solved more or less autonomously by AI (after some feedback from an initial attempt), in the spirit of the problem (as reconstructed by the Erdos problem website community), with the result (to the best of our knowledge) not replicated in existing literature (although similar results proven by similar methods were located).

This is a demonstration of the genuine increase in capability of these tools in recent months, and is largely consistent with other recent demonstrations of AI using existing methods to resolve Erdos problems, although in most previous cases a solution to these problems was later located in the literature, as discussed in https://
mathstodon.xyz/deck/@tao/115788262274999408 . This particular case was unusual in that the problem as stated by Erdos was misformulated, with a reconstruction of the problem in the intended spirit only obtained in the last few months, which helps explain the lack of prior literature on the problem. However, I would like to talk here about another aspect of the story which I find more interesting than the solution itself, which is the emerging AI-powered capability to rapidly write and rewrite expositions of the solution. (1/5)

2d
Terence Tao @tao

Let me begin by quickly recapping the history of this problem. In 1975, Erdos, Graham, Ruzsa, and Strauss studied the prime factorization of binomial coefficients such as
, and posed many related questions. One such question (which was an offhand spinoff of a different question) asked whether one could find infinitely many 𝑎,𝑏,𝑛 with 𝑎,𝑏≥ε𝑛 obeying the divisibility condition 𝑎!𝑏!|𝑛!(𝑎+𝑏−𝑛)! and 𝑎+𝑏>𝑛+𝐶log𝑛. However, the question was vaguely worded in a number of respects; for instance it was initially unclear whether 𝐶 was intended to be small or large.

A few months ago, a team associated with the AI tool AlphaProof observed that the problem admitted several trivial solutions, if 𝑎 or 𝑏 were allowed to be large compared with 𝑛. This technically solved the problem, but was deemed not in the spirit of the question, and an additional constraint 𝑎,𝑏≤(1−ε)𝑛 was imposed to rule out these solutions. Further AI-assisted literature search did not turn up significant work on this problem. (2/5)

On Jan 4, ChatGPT was able to produce a proof even with the adjusted constraint, but with 𝐶 taken to be a small constant: https://
chatgpt.com/s/t_695bdbf3047c8191af842d03db356b1a ; this was then formalized by Aristotle into Lean. However, it was determined on closer reading of the source paper that 𝐶 was intended to be large; indeed, the results in the original paper already established the small 𝐶 claim, although this was not evident to us until a few days later.

Shortly afterwards, a different web site participant ran the Lean proof through ChatGPT to rewrite it in natural language https://
drive.google.com/file/d/1ejHqEddpD52SYubZlK8eYx2aoanFmMs8/view?usp=sharing , and then after further conversation obtained an improved writeup
https://
drive.google.com/file/d/1HAoRuYiUTN0PNOhjY96-dWKNqJ1X4Ucy/view?usp=sharing , in which several gaps in the original proof were filled in. The exposition was still somewhat clunky and "AI" in feel, and lacked much of the remarks and references to literature that would organically accompany a human-written proof, but was readable enough that the general ideas of the proof could be readily extracted. (3/5)

Meanwhile, with further prompting, ChatGPT was also able to adapt the argument to handle large 𝐶 as well as small 𝐶, thus finally producing a new result in the spirit of the intended question https://
drive.google.com/file/d/1xRw8_o2C8HwmxMDnBR5OJlxXaW7jlYbz/view?usp=sharing . Interestingly, the proof contained some minor errors in it, but the AI tool Aristotle was able to automatically repair these gaps and produce a Lean-verified proof.

At this point, a third particiant ran Aristotle again on the existing Lean proof to provide a shorter version, which a different participant then input into a lengthy back-and-forth ChatGPT session https://
chatgpt.com/share/695e7cbd-605c-8010-809b-ccba75560c76 to turn it into a much more fully fleshed article that described not just the proof itself, but its connection with prior literature and with a tighter narrative structure. This resulted in a new writeup of the proof
https://
drive.google.com/file/d/1MRQfcHhrYMfMTvlZcMC3zEK7aOrUyHiQ/view?usp=sharing that had less of the feel of a generic AI-produced document, and which I judge to be at a level of writing within ballpark of an acceptable standard for a research paper, although there is still room for further improvement. (I review this text at
https://www.
erdosproblems.com/forum/thread/728#post-2852 ). (4/5)

My preference would still be for the final writeup for this result to be primarily human-generated in the most essential portions of the paper, though I can see a case for delegating routine proofs to some combination of AI-generated text and Lean code. But to me, the more interesting capability revealed by these events is the ability to rapidly write and rewrite new versions of a text as needed, even if one was not the original author of the argument.

This is sharp contrast to existing practice where the effort required to produce even one readable manuscript is quite time-consuming, and subsequent revisions (in response to referee reports, for instance) are largely confined to local changes (e.g., modifying the proof of a single lemma), with large-scale reworking of the paper often avoided due both to the work required and the large possibility of introducing new errors. However, the combination of reasonably competent AI text generation and modification capabilities, paired with the ability of formal proof assistants to verify the informal arguments thus generated, allows for a much more dynamic and high-multiplicity conception of what a writeup of an argument is, with the ability for individual participants to rapidly create tailored expositions of the argument at whatever level of rigor and precision is desired.

Presumably one would still want to have a singular "official" paper artefact that is held to the highest standards of writing; but this primary paper could now be accompanied by a large number of secondary alternate versions of the paper that may be somewhat looser and AI-generated in nature, but could hold additional value beyond the primary document. (5/5)

muyuu · « **Respuesta #2332 en:** Enero 23, 2026, 00:59:40 am »

C-Suite son ejecutivos nivel CEO/CFO/CTO

Cadavre Exquis · « **Respuesta #2333 en:** Enero 23, 2026, 19:18:52 pm »

Citar

Anthropic's AI Keeps Passing Its Own Company's Job Interview
Posted by msmash on Friday January 23, 2026 @11:01AM from the too-smart-for-its-own-good dept.

Anthropic has a problem that most companies would envy: its AI model keeps getting so good, the company wrote in a blog post, that it passes the company's own hiring test for performance engineers. The test, designed in late 2023 by optimization lead Tristan Hume, asks candidates to speed up code running on a simulated computer chip. Over 1,000 people have taken it, and dozens now work at Anthropic. But Claude Opus 4 outperformed most human applicants.

Hume redesigned the test, making it harder. Then Claude Opus 4.5 matched even the best human scores within the two-hour time limit. For his third attempt, Hume abandoned realistic problems entirely and switched to abstract puzzles using a strange, minimal programming language -- something weird enough that Claude struggles with it. Anthropic is now releasing the original test as an open challenge. Beat Claude's best score and ... they want to hear from you.

Saludos.

Saturio · « **Respuesta #2334 en:** Enero 26, 2026, 09:42:03 am »

Cita de: Cadavre Exquis en Enero 23, 2026, 19:18:52 pm

Citar
Anthropic's AI Keeps Passing Its Own Company's Job Interview
Posted by msmash on Friday January 23, 2026 @11:01AM from the too-smart-for-its-own-good dept.

Anthropic has a problem that most companies would envy: its AI model keeps getting so good, the company wrote in a blog post, that it passes the company's own hiring test for performance engineers. The test, designed in late 2023 by optimization lead Tristan Hume, asks candidates to speed up code running on a simulated computer chip. Over 1,000 people have taken it, and dozens now work at Anthropic. But Claude Opus 4 outperformed most human applicants.

Hume redesigned the test, making it harder. Then Claude Opus 4.5 matched even the best human scores within the two-hour time limit. For his third attempt, Hume abandoned realistic problems entirely and switched to abstract puzzles using a strange, minimal programming language -- something weird enough that Claude struggles with it. Anthropic is now releasing the original test as an open challenge. Beat Claude's best score and ... they want to hear from you.
Saludos.

Según una publicación en su web, Persianas Puyuelo suministra unas mosquiteras con las que cualquier familia respirará menos insecticidas, descansará mejor y no rascará.

https://www.persianaspuyuelo.es/.

Ya me entendéis.

muyuu · « **Respuesta #2335 en:** Febrero 10, 2026, 15:28:31 pm »

https://arxiv.org/abs/2512.20798

Citar

A Benchmark for Evaluating Outcome-Driven Constraint Violations in Autonomous AI Agents
Miles Q. Li, Benjamin C. M. Fung, Martin Weiss, Pulei Xiong, Khalil Al-Hussaeni, Claude Fachkha
As autonomous AI agents are increasingly deployed in high-stakes environments, ensuring their safety and alignment with human values has become a paramount concern. Current safety benchmarks primarily evaluate whether agents refuse explicitly harmful instructions or whether they can maintain procedural compliance in complex tasks. However, there is a lack of benchmarks designed to capture emergent forms of outcome-driven constraint violations, which arise when agents pursue goal optimization under strong performance incentives while deprioritizing ethical, legal, or safety constraints over multiple steps in realistic production settings. To address this gap, we introduce a new benchmark comprising 40 distinct scenarios. Each scenario presents a task that requires multi-step actions, and the agent's performance is tied to a specific Key Performance Indicator (KPI). Each scenario features Mandated (instruction-commanded) and Incentivized (KPI-pressure-driven) variations to distinguish between obedience and emergent misalignment. Across 12 state-of-the-art large language models, we observe outcome-driven constraint violations ranging from 1.3% to 71.4%, with 9 of the 12 evaluated models exhibiting misalignment rates between 30% and 50%. Strikingly, we find that superior reasoning capability does not inherently ensure safety; for instance, Gemini-3-Pro-Preview, one of the most capable models evaluated, exhibits the highest violation rate at 71.4%, frequently escalating to severe misconduct to satisfy KPIs. Furthermore, we observe significant "deliberative misalignment", where the models that power the agents recognize their actions as unethical during separate evaluation. These results emphasize the critical need for more realistic agentic-safety training before deployment to mitigate their risks in the real world.

Es interesante ver hasta qué punto están dispuestas las compañías detrás de los distintos modelos LLM a censurar y bloquear usos que "violan normas éticas o legales"

Porque los chinorris te van a sacar modelos con otra censura muy distinta y al final te cargas tu propio modelo de negocio con tu puritanismo.

Yo lo tengo claro, a mí me dice tres veces "cannot do that" un modelo y ya me estoy bajando uno abliterado (me lo bajo de todas maneras, pero es un decir).

muyuu · « **Respuesta #2336 en:** **Hoy** a las 15:05:09 »

https://www.youtube.com/watch?v=DbFYDsXJ11M

me ha parecido gracioso el punto de vista de este chaval, refleja un poco el sentir de esa franja de edad de universitarios

Citar

Job searching in 2026 has effectively become a humiliation ritual. For those of you who don't know me, my name is Jett. I'm currently a senior in college on my last semester before I graduate. So, I'm in the prime pool of people that would be looking for a job. All of my peers, people that I grew up with, are also in the same boat as me. And we have talked extensively about this idea.

Did you know that the average person nowadays has to send 1,200 - you didn't mishear me - 1,200 job applications before they land a job? The average 40 years ago was three. The economy is going to shit and we all intuitively feel this. And it's been going this way for quite some time.

The funniest part about it, right, is that you send in these resumes nowadays and humans don't even look over the resumes. Instead, it's some sort of AI that's looking for keywords, right? And so to combat this, what people do is they use AI to create their own resume because they want, for whatever specific industry they're going into, they want AI to make sure that the correct trigger words are on there. So, in other words, if you can't seduce an AI, then you're not getting a job, right?

And so now we have AI talking to other AI. Ladies and gentlemen, welcome to late-stage capitalism. I mean, it's just ridiculous, is it not? It's comical. I would love to laugh about it, but I'm in the shit right now, right? Like, it's so ridiculous.

And every part of the process is completely in favor of these corporations because some of them will try to detect if you used AI on your resume and then they'll say, "Well, listen, we used AI to look over your resume, but the fact that you used AI on your resume, that's a no-no." And so you're no longer an applicant anymore.

Let's talk about ghost jobs for a second here, too. I'm just going to be all over the place in this video, but just bear with me here. There are literally fake jobs that companies will put out there so that they seem more desirable and like they're hiring. And it's so disrespectful. And to do that from the position of power, like the corporations have the power, is beyond me. It's so ridiculous.

One of my good friends - he's 2 years older than me, but he graduated last year, so it's been a little over a year in the job market, a year and a half. And this is legitimately the individual with the most agency out of anyone that I know. So he's very proactive. He constantly takes control of his own life, right? And he has KPIs that he does every day. So he has to submit a minimum of four applications every single day. He's never missed a day. And he does that among many other things that he does.

But you know, smart guy. He's nothing over the top. He's no crazy good candidate, but he's bare minimum average, maybe a little bit above average, and then he works really hard. So, shouldn't a guy like that, the average guy, be rewarded with a decent job? Is that too much to ask for? Yeah. Yeah. Apparently, it is.

And in about four to five years - I'm going to say not even that long - I don't know if you guys have seen Clawbot. It's getting pretty scary. But AI will take over. And the only reason that AI hasn't already taken over is because they don't understand how to manage the liability of AI. So AI, like I just mentioned, Clawbot, it's just a virtual assistant. It could pretty much do anything for you. I'm considering actually buying it myself to try to help me with trying to make money online, but essentially it can do the majority of basic jobs.

And then like let's see - it's advancing at a faster rate than ever before. So in one year from now, AI could potentially do 90% of desk jobs, like jobs that are done on a computer. And at that point the only thing that's stopping the implementation of this is liability. How do these companies control like who they blame? Because if a doctor has malpractice, then they're like, "Hey, you know, you're getting in trouble for that." Or if a member of a hedge fund is fucking up, they have tracking abilities and that person could be punished.

With AI, it's like, "Oh, this went wrong. Who do we fire? Who do we hold accountable?" There really isn't one. And for larger companies handling critical information like people's money, their financials, how they're getting paid through their bank accounts - you're sitting at very high-risk situations. And oh my gosh, don't even get me started on LinkedIn. LinkedIn is the fakest place on earth. Everybody seems like they're selling their soul on Linked-in, like no bullshit.

The posts from people I grew up with, friends of mine - I'm not trying to judge, but it's like everybody's a pussy but at the same time, I don't even blame them. They need a job, but it seems like they're selling their soul just to get a job they don't even want that AI will take over in four to five years. It is a joke. It's a joke.

For those who don't know me and maybe clicked this because you saw the title and you related - I do social media. I just started in October of last year, so about five or six months. Now I have a greater asset in my social media following - around 300k on Instagram, 550k on TikTok - than my degree will be when I graduate at the end of the semester. I'm not bragging; it's just reality.

Which is why I'm trying Claude to show me how to make money online. Right now I'm making no money besides views, and TikTok pays terribly - 300 million views and maybe a thousand bucks. Instagram pays nothing for views.

Which is why I just told you guys I'm trying to do Clawbot cuz I want to uh tell it to—I want it to show me how I can make money online. Cuz right now I'm making no money at all besides like what I get paid for views. And anybody that knows like TikTok it is like terrible.

It's I've gotten like over 300 million views on TikTok and I haven't even been paid like a thousand bucks or something. 300 million views, guys. It might actually be more than that. Actually, it could potentially be a lot more than that, but like it's just so bad. Instagram, I get paid no money from like you don't get paid for views at all. At least maybe you could, but I don't—I'm getting too sidetracked right now.

What I want to say is that it almost seems like you could be working your butt off in college or maybe you're already—you already have a job, but it's entry level and you're trying to work your way up, but it's what are we doing it for?

Because we're just going to get replaced by AI. And these companies, these corporations, I don't need to convince you. You guys know they don't care about you. They don't. This is all you are to them. You're an asset. You make them money.

The average employee makes a company four times what their salary is. That that is a fact. So, the first time ever, um the average age of a new hire is 43 years old. That's the highest it's ever been in American history, I believe. If I'm not wrong, do you know how like messed up that is? ( )

Shouldn't it be your like college students? People aren't going to be able to retire. And with inflation, you work really hard right now, you get paid, let's say you get paid $30 an hour. Well, $30 an hour in 20 years from now is going to be worth the equivalent of maybe $10 an hour. And then in another 20 to 30 years when you would retire, it's going to be worth like a few dollars.

So when they promote you, it's not even them promoting you. It's just saying, well, we're so inflation increased by like 7% and your your um promotion is going to give you an extra 2-3%. So you're going to be happy, right? No, you're actually like not even um levelling out with the market with with the value of the dollar.

Do you like it's just—it's such a joke. I'm saying this stuff out loud and I'm like, is this even real? Like it. I can't believe this is real.

It's ridiculous not to believe that AI can't take your job. I know a lot of people like hate AI and all this stuff. Um, but you have to like live in the real world and try to plan accordingly to like what's going on.

Like this is game theory optimal, right? Like you got to figure out the game and then like try to position yourself in the best spot. So Naval Ravikant, he says that the future is—it's all going to be about creation.

So you need to create something. It doesn't need to be content creation like what I'm doing right now, but you have to create a business. You need to create a skill set that's going to be valuable for something cuz AI is going to be able to automate most things.

It's your best bet to plan accordingly. And I feel like it's a better use of my time to try to figure out how these AI systems work. And I'm not talking ChatGPT that's going to help me write a like complete my homework or for the people that are already working write a uh email to one of your co-workers.

I'm talking about like the actual beauty of AI which is that it can write code. So you don't—as long as you can speak English you can now create software. Um and whether that's an app or whether that's a website or whether that's just like a service in general.

If you can create something good enough then you can be your own boss. And there's talks in the air of what is referred to as universal basic income.

[...]

(transcrito por Qwen3.5)

Transición Estructural .NET

Noticias:

Blog

Últimos mensajes

Temas mas recientes

Autor Tema: El fin del trabajo (Leído 1729012 veces)

Saturio

Re:El fin del trabajo

Cadavre Exquis

Re:El fin del trabajo

muyuu

Re:El fin del trabajo

muyuu

Re:El fin del trabajo

Cadavre Exquis

Re:El fin del trabajo

muyuu

Re:El fin del trabajo

muyuu

Re:El fin del trabajo

muyuu

Re:El fin del trabajo

Cadavre Exquis

Re:El fin del trabajo

Saturio

Re:El fin del trabajo

muyuu

Re:El fin del trabajo

muyuu

Re:El fin del trabajo