FRESH

Hacker News

Home

Claude Science

561 points by lebovic

by lebovic

8 subcomments

I built one of the connected tools included in this launch (the Biomni HPC [1]), and I have spent an inordinate amount of my life working on this problem. (I also worked at Anthropic, but not on this product.)
As other comments have pointed out, this is for data science – but it's capable of more than making plots and writing papers [2]. It has integrations with many databases and computational tools, including a researcher's institutional cluster.
That alone is valuable. I founded a startup after struggling with this problem at a bio startup; integrating these tools and databases is hard and time consuming. If the only outcome of this product is that great APIs are built for LLMs, it will be a massive positive impact. Many databases used in computational genomics are still only accessible through FTP!
LLMs are particularly good at navigating these tools and databases. It's often very specialized, but straightforward, work that benefits from in-context skills. Seeing an early glimpse of my former customers – bioinformaticians – using LLMs to solve this problem is what led me to join Anthropic in 2024.
Also, this pattern isn't fundamentally constrained to data science: you can also integrate with a wet lab or a CRO for some kinds of science. This is what I'm spending my time on now.
This type of science doesn't solve everything, but it's useful in some niches. For example, progress on many rare diseases is bottlenecked by researcher attention rather than a fundamental breakthrough.
[1] https://x.com/phylo_bio/article/2029233694775624096
[2] In comparison, OpenAI's science product – Prism – was effectively a LaTeX editor they acquired with Crixet.

by gjuggler

2 subcomments

The most interesting thing here is that Claude Science runs a local server and a web-based UI that connects to that server from your browser. This is very different from Claude Code and Cowork, where the UI is more tightly coupled to the host machine (which makes things like computer use possible).
I think I recognize the strategy: most pharma environments connected to interesting data are tightly locked down, to the point where you can't just connect your Macbook to the source data.
Similarly, access to large genomic biobank datasets like UK Biobank or NIH's All of Us program is granted only through a Trusted Research Environment (TRE), a remote data analysis platform usually quite restricted on internet access, etc. You can't easily run desktop apps, but these environments do usually support running JupyterLab or VS Code, tunneling the user interface through to the end user. (Source: I previously ran the team that built the All of Us TRE.)
Claude Science looks a lot more like something one could imagine spinning up in one of those highly-constrained data environments (with the "server" running within the TRE and the UI proxied to the end user's browser) than the does-everything Claude mega-app. That will be critical for traction within pharma R&D environments.
I will say that for moderately-computational scientists, who are daily driving RStudio, JupyterLab, or maybe VS Code, Claude Science will be quite an unfamiliar shaped product. I'll be curious to see whether something like this gains adoption (1) in place of, (2) alongside, or (3) eventually wrapping around the more traditional data science workbench tools out there.

by packeted

3 subcomments

I watched the announcement and gave it a spin as I'm a heavy user of cowork/code. So far I'm super impressed. I used it to analyze my whole genome sequencing data I have as my son has a rare genetic condition. I used it to answer a question I'd asked a few bioinformaticians to help me with but never got a satisfactory answer, it solved it in about a minute - whether his n-of-1 de novo, heterozygous single nucleotide mutation was likely passed down from mom or dad. It performed a read-backed phasing analysis on the data, identified a nearby SNP with overlapping coverage where mom was homozygous and dad was heterozygous. Identified my variant on his mutated allele so looks like it came from me..
It also crosschecked my data against AMCG Secondary Finding genes and ClinVar likely pathogenic/pathogenic variants and came back with identical results to my Natera Horizon carrier screening results.
I'd previously tried and failed to do this all with some ChatGPT guidance and subsequently hired a couple of bioinformatician post-docs at top tier universities via Upwork who had failed to give me satisfactory results.
And this is just getting started!

by teekert

1 subcomments

I'm a scientist, (biophysicist). Over time I have become a bioinformatician and a python dev.
I wrote articles and applications, and it always was a struggle. But now I can speed up, make it all go much faster. But I often feel like my mental models can't keep up.
Recently the AI has generated a comprehensive data model (in Django) and I find myself retracing its steps with long discussions and explanations (with/from the LLM) and searching for documentation. With scientific assignments I find myself searching literature on my own, read whole papers as I used to. Checking the LLM constantly but adapting to it and I don't like it, don't like how it steers me, just let me search, let me wander the scientific landscape on my own, let me read the words of the authors with opposing views. Then let me make 20 plots and only use 1, let me wrestle with the data. Let me make wrong visuals that by chance communicate something important about the data.
Because otherwise I feel uncomfortable, I need to understand, that is what I do. I can reason about so many things because my internal world model is comprehensive and mostly correct. That has taken 44 years so far. Hard work from time to time, but I've mostly enjoyed it.
I still don't know what to make of these models, I use them everyday, but sometimes I wonder if I was not just as fast with Stack Overflow, because what I crave is understanding, not "some finished app". Yes, I rarely finish things fully (that's how I feel), but in research I've often been told they like my ability to move very fast and creatively in phase one, the development is left to others anyway...
I crave an understanding of what these tools mean to me exactly. This comment is part of that. HN is part of that.

by gravelc

2 subcomments

Tried this to see how it goes in my particular field - computational design of RNAi-based biopesticides. One-shotted a design for targeting the DvSnf7 transcript of western corn rootworm. It took a fairly naive approach (maybe how a 1st year PhD student would go about it), but got the job done. Also noted caveats with its approach (e.g. using mammalian design rules, limited off-target screening). Not bad really. But also not great. When its flaws were pointed out, the AI determined that it could have taken a more informed approach. Then Opus 4.8's safety system flagged the session.

by minimaxir

4 subcomments

When I saw "Science" I didn't think they meant Data Science, which is what the UIs full of pandas code and plots imply. Even if the focus is on the sciences, I suspect that's the less valuable part of the announcement particularly with the implication of Jupyter Notebook 2.0.
Image-understanding for data viz is a use case that has been ignored, and modern LLMs are getting better at proper EDA. But, uh, I may need to update my resume.

by PotatoFarmsKing

2 subcomments

Before LLMs the tech groups I followed were ripping with discussions about this and that topic, what to use and when; I believe these discussions sparked the creation of many frameworks and tools out of "this seems like a good idea, wouldn't hurt to implement it". Unfortunately it all resolves around LLMs nowadays and how to make some LLM work some way or another, we don't even discuss the very topics the groups were created to discuss. I fear science is soon to taste the same thing - discussions about LLMs taking place instead of the actual topics that would be discussed otherwise.

by Recursing

1 subcomments

This seems to have unblocked Claude Desktop for Linux ( https://code.claude.com/docs/en/desktop-linux )

by celltalk

6 subcomments

I basically did the same thing almost one and half years ago and not many people cared, but I still believe that this is the future for computational biology.
https://celvox.co/solutions/axon

by qwerty_clicks

1 subcomments

Should be called Claude-bio-big-bucks.
What about earth science, physics, engineering? The connectors and skills are all just biology and pharma. Boo

by Sol-

1 subcomments

So it's like Claude Cowork for Science, i.e. for less tech-savvy users? I would imagine scientists with some coding background might just prefer to use Claude Code normally and integrate it with their stack of choice, but perhaps the comfort and ease of use of Claude Science still wins out.

by evolighting

0 subcomment

Around the time I graduated from the research institute in 2020, it seems my lab already had a similar infrastructure, just without LLMs and agents.
Back then, we had data repositories, databases, Jupyter Notebooks, Slurm batches, open computing platforms, and so on. It could do similar things ---- just by hand.
While adding an LLM agent can indeed drastically improve usability, it must be a massive headache for system administrators. It honestly sounds like introducing a huge, uncontrollable wildcard into the system.

by dbcooper

0 subcomment

A "standing review agent" seems to be one of the main differences beyond the new connectors and in place visualisation tools.
>A standing reviewer agent. This runs in the background during a session, checking citations against sources, flagging numbers it can't trace back to evidence, and catching figures that don't match the code that supposedly generated them. That's not something Code or Cowork do automatically — you'd have to ask Claude to double-check itself as a separate step.

by kfse

0 subcomment

I've worked with similar tools and while they're impressive, it's too often the case that the LLM literally makes up fake but realistic looking data and pretends that it's real. This includes pretty deep fakery like setting up mock database connectors so that it looks like you're fetching data from the right place, but it's just getting synthetic data
How does this guard against that?

by raphman

2 subcomments

tl;dr: Use this if you don't like doing science or doing things well. It hallucinates references.
Seems to be based on https://github.com/swaruplab/operon as evidenced by the authorization dialog and https://x.com/testingcatalog/status/2037684573161783373 .
Mostly targeted at life sciences - e.g. integration for FDA, PubMed, genomics databases but no ACM / IEEE as far as I can tell.
Edit: arXiv search seems to be supported - but not Google Scholar etc. So, this tool is of little use for most researchers outside life sciences.
Edit 2: Quick walkthrough: the AppImage starts a browser window with an onboarding wizard and a chat interface. It suggests a few things one might do at the start of a research project - e.g. do a quick literature review. When I chose that option, wrote Python scripts that used MCP calls to do arXiv searches. Stayed seemingly stuck there for a few minutes not returning anything. Then:
> The free-text search returned too much noise
Claude decided to choose a certain paper as a starting point for further research. Shortly afterwards:
> That DOI resolved to the wrong paper. Let me find the correct anchor papers by title/author search directly.
Then it meandered a few more minutes doing research and creating a citation graph (that it did not show to me).
> I have a complete picture. Let me verify the key DOIs resolve and then write the review.
Then:
> The lint flags em-dash overuse. Let me reduce them, then save.
Then: a nice but verbose literature overview of my chosen topic
<blink>BUT it includes at least one hallucinated reference!</blink>
P.S.: What does this mean?
```
  [reviewer] verifier_mode=default-on downgraded to off: pro subscription tier, autoReviewer withheld (frame=f2a81cb2)
```

by immmmmm

1 subcomments

When I was doing my phd, around 2 decades ago, I was often going to the library’s compactus to fish for a Phys Rev from the 80s. Back then papers were sparse and expensive. But the quality!
The Higgs boson is 3 papers, 6 authors and 6 pages in total!
At the end of my phd, 30++ pages slop papers were the norm.
Nowadays, well..
The paper by Higgs was one page. The guy probably published less than a hundred pages in his career.
One reason that made me abandon a career was the disgust caused by the publishing frienzy.
And now tokens..

by cmiles8

2 subcomments

Science isn’t suffering from a lack of papers. It’s suffering from a lack of good papers. Making it easier to just pump out paper-mill publications is about the last thing science needs right now.

by jszymborski

1 subcomments

Any other researchers paranoid of using LLMs for fear of them using your data and front running your publications/work?
Or incorporating it in training data and then spitting it out to a competing lab?

by ChrisArchitect

0 subcomment

Blog post: https://www.anthropic.com/news/claude-science-ai-workbench

by zftnb666

0 subcomment

Claude Science: $200/mo. Me, a scientist: copy-pasting into Claude and saying "I did the analysis."

by cowpig

0 subcomment

I've always found that what science is really lacking is closed, proprietary ecosystems trying to build for-profit moats around research.
Thank our lords at Anthropic for stepping into this void

0 subcomment

by Alexadar

0 subcomment

Interesting to test. I set up all scientific subroutines with claude code generated automation and visualization. Honestly, i think that this product would not be a fit for all given diversity of scientific tasks.

by chazeon

0 subcomment

Isn't this the company that make the LLM become a degenerate when it comes to bioscience?

by jkwang

0 subcomment

Claude Science sounds like a useful shift toward reproducible agentic research. The built-in error recovery and tool orchestration could make it practical for real lab workflows, not just demos.

by trallnag

0 subcomment

"Pre-configured for your domain [...] cheminformatics" as in something like ChEMBL?

by alpineman

0 subcomment

So that's why Fable was refusing those biology questions

by hooloovoo_zoo

0 subcomment

Doesn’t seem like much value-add beyond pointing Claude code at org mode.

by stanford_labrat

0 subcomment

impressive to me, but sadly i feel a little misleading since this is only the data-science part of life sciences.
every few weeks though i test claude and chatgpt on their scientific reasoning and it has definitely improved over time. in my experience without specific instruction on what is known/unknown they typically are lagging behind the leading edge of the field (dev bio/pluripotency in my case). probably because scientific research articles are not open-source so they can't crawl them.
claude has definitely outperformed chatgpt in this regard however, it's scientific reasoning is impressive.

by jerven

0 subcomment

Working on the uniprot services that might be used from the connector it would be nice to learn if this uses public resources or if there is a private anthropic copy of certain uniprot data sets.

by JoshGlazebrook

4 subcomments

The fact that we are coming up on a month of Fable being unavailable with essentially zero actual signal from Anthropic around when it may be back is crazy to me. Yet still we have these random new products coming out?

by dmezzetti

0 subcomment

Why does HN let OpenAI and Anthropic basically advertise but it throws down the gauntlet at a small developer like myself when we do "self promotion"?
Top 3 posts as of this moment are all about Claude.

by ai_fry_ur_brain

1 subcomments

Why would you people ever use this companies products? They're actually evil and are trying to scam you and or make you unemployable./worthless. You people really gotta wake up.

by khurs

1 subcomments

Big Pharama = Big Budgets.
So targeting them with a tailored product is understandable.

by jvanderbot

0 subcomment

Thought I'd give it a whirl - crashed immediately.
I was tickled they had a "Download for linux" button prominently shown, but nothing yet.

by nickandbro

0 subcomment

So I guess they released this instead of Sonnet 5?

by domrdy

2 subcomments

It has Sonnet 5 as a usable model. Interesting.

by fastaguy88

0 subcomment

Download for mac. Find out I need a different subscription. Cannot quit program (must force quit).
Perhaps I need AI to use it.

by trevor519

0 subcomment

They are offering this free to igen students which is really cool

by theplumber

0 subcomment

They forgot to include an example of prompt error on “cancer” with Fable in that “nice” video.

by nmilo

0 subcomment

> Inspect proteins, alignments, genomic tracks, chemical structures, and PDFs in their native form, with no extra installation required.
I like how this implies parsing PDFs is as hard as like protein folding

by zmmmmm

0 subcomment

I can't decide if this will make science better or be the death of it. The potential wave of slop about to hit journals is frightening. Essentially what happened with GitHub code reviews is about to hit academic peer reviews and it isn't going to be pretty.

by imdsm

0 subcomment

Weird that it runs as a local webserver rather than as an app

by woadwarrior01

1 subcomments

Looks like Cursor and Jupiter Lab had a baby.

by Aeroi

0 subcomment

what happens when anthropic launches products for every vertical?

by tripleee

1 subcomments

maxed out on coding improvements so now they're trying to expand to other markets

by cute_boi

0 subcomment

whats up with all these samosa? Samosa Manuscript, Samosa Benchmarking?

by brcmthrowaway

0 subcomment

by game_the0ry

1 subcomments

Disappointing that science came after cowork. Shows how their priorities are for profitability first and help humanity second.

by bozdemir

0 subcomment

Another overrated packaged workspace to drain more usage... No thank you.

by CamperBob2

0 subcomment

Claude: "Not that science"

by botfriendsarent

0 subcomment

Dude! Give me some stolen science!

by calldacopsidgaf

1 subcomments

this a great application for the sycophantic, non-deterministic lying machine!

by Retr0id

0 subcomment

> every step from data wrangling to *publication*
Do they have no shame?
Edit: seems like no https://news.ycombinator.com/item?id=48736814

by cws_ai_buddy

0 subcomment

[flagged]

by daiz2025

0 subcomment

[dead]

by mariorossi25

0 subcomment

[dead]

by simonuuu

0 subcomment

[dead]

by mv_d5339e31

0 subcomment

[dead]

by mariorossi25

0 subcomment

[dead]

by agastalver

0 subcomment

[dead]

by xarthurx

0 subcomment

[dead]

by xarthurx

0 subcomment

[dead]

by devilfileprong

0 subcomment

[dead]

by aplthrowaway67

0 subcomment

[dead]

by bigyabai

1 subcomments

How about no?
AI brand identity has made the unfortunate pivot to "how much do you trust us" which is going be a real race to the bottom. I don't want LLMs managing nuclear reactors or replacing junior lab technicians. I don't trust any of these LLMs to do the bare minimum, regardless of how good it is for your brand.
It's gross watching these stunts unfold. Next ChatGPT will fly a passenger jet, which Claude will one-up with an agentic surgery, which OpenAI will respond to by putting a humanoid robot on the moon. If this is what 21st century market competition looks like, we are all fucked.