{"id":1431,"date":"2026-06-17T12:34:33","date_gmt":"2026-06-17T12:34:33","guid":{"rendered":"https:\/\/americanhomejournals.com\/?p=1431"},"modified":"2026-06-17T12:34:33","modified_gmt":"2026-06-17T12:34:33","slug":"one-of-legals-hottest-startups-is-helping-lawyers-finally-answer-is-the-ais-work-any-good","status":"publish","type":"post","link":"https:\/\/americanhomejournals.com\/?p=1431","title":{"rendered":"One of legal&#8217;s hottest startups is helping lawyers finally answer: Is the AI&#8217;s work any good?"},"content":{"rendered":"<section>\n<p>Legal technology wants its vibe-coding moment. But first, it has to prove the tools can think like a lawyer.<\/p>\n<p>Read more <a href=\"https:\/\/americanhomejournals.com\/?p=1429\">The next office power struggle: AI tokens<\/a><\/p>\n<p>Taking up the task is Crosby, a startup-meets-law-firm that sells basic legal services to companies, including Cursor and Rogo. On Wednesday, it released the Redline Bench, a tool built to measure how well artificial intelligence models perform real-world legal tasks, starting with contract review.<\/p>\n<p>Software engineers have spent the past few years watching these systems get shockingly good at writing code and debugging errors. Now legal tech companies are chasing a similar prize: artificial intelligence that can review contracts, spot risks, and haggle terms faster and cheaper than lawyers.<\/p>\n<p>But law has a problem that coding does not, says Ryan Daniels, a former in-house lawyer turned Crosby founder. &#8220;It&#8217;s really hard to define &#8216;good&#8217; or &#8216;bad,'&#8221; he said.<\/p>\n<p>Models can write code that either runs or breaks. Legal work is a murkier target. A sales contract can be edited, or &#8220;redlined,&#8221; in lots of defensible ways, Daniels explains. A change that one lawyer sees as prudent, another might call too aggressive.<\/p>\n<p>That ambiguity has become a headache for companies racing to automate legal work, from the scrappy neofirms to the model labs themselves. Anthropic has spent the past few months courting in-house lawyers with tools built for them. That push has been closely watched by investors. Earlier this year, Anthropic&#8217;s new legal plugin stirred a sell-off in legal tech stocks.<\/p>\n<p>Benchmarks are one of the main ways companies track progress. The labs building frontier models use them as stress tests, measuring whether a new system is better at tasks than the last one.<\/p>\n<p>Coding has hundreds of benchmarks for evaluating models. But the legal industry still lacks a shared way to answer the question: Is the AI&#8217;s work any good?<\/p>\n<p>Crosby has been working on a new yardstick. The company pulled its engineers and lawyers into a tactical unit called Crosby Intelligence to build agents for Crosby&#8217;s law firm and a benchmark to grade them against. That team includes engineer Sharan Ramjee, who worked on transformer models to sniff out fraud at Stripe, and Ross Weiser, a lawyer who joined from elite law firm Sullivan &amp; Cromwell.<\/p>\n<p>Crosby also partnered with Micro1, a company that helps model-makers recruit expert workers, to find more lawyers who could help define what counts as good legal work.<\/p>\n<p>Read more <a href=\"https:\/\/americanhomejournals.com\/?p=1426\">Allbirds is now Smartbirds, and its AI-focused CEO says people won\u2019t even remember the shoes\u2019<\/a><\/p>\n<p>To build the benchmark, senior lawyers simulated software deals and marked the contract changes they considered most important at each stage of the negotiation. Those changes were turned into weighted criteria.<\/p>\n<p>When Crosby runs a new test, it gives models the same contracts and asks them to make their own edits. Then a panel of three judges compares these redlines with the lawyer-built rubric. The judges vote pass or fail on each item, and the final score shows how often the models made the kinds of edits that lawyers considered important.<\/p>\n<p>Redline Bench will be made public so any lab can put its models through Crosby&#8217;s paces. Crosby also plans to regularly release reports tracking how major models compare.<\/p>\n<p>The first release of the Redline Bench put ChatGPT 5.5 at the top of the heap, with a score of 50.5%, meaning the model&#8217;s redlines matched half of the edits that lawyers prioritized. Gemini 3.5 Flash followed at 45.1%, and Claude Opus 4.8 scored 44.4%.<\/p>\n<p>Crosby was able to test Anthropic&#8217;s highly capable new model, Fable 5, only once before Anthropic pulled it off the shelves. The results were promising, with a score of 47.3%. When access is restored, Crosby will run the benchmark again and update it.<\/p>\n<p>Crosby isn&#8217;t the only company trying to measure how the models stack up. Harvey, one of the best-funded legal startups, has released benchmarks for case law research and contract review.<\/p>\n<p>Anthropic and OpenAI also build their own benchmarks to measure performance on real-world tasks. But Daniels said those results can be hard to trust. Over time, the labs eventually tune their systems to perform well on their own tests, he said.<\/p>\n<p>The stakes are bigger than a scoreboard. Billions of investment dollars are riding on the promise that artificial intelligence can lower legal bills and absorb work that used to pile up on the general counsel&#8217;s desk.<\/p>\n<p>Lawyers will only use the tools if they trust them. Crosby wants to give them a reason to.<\/p>\n<p>Read more <a href=\"https:\/\/americanhomejournals.com\/?p=1424\">Bose is becoming a media company<\/a><\/p>\n<\/section>\n","protected":false},"excerpt":{"rendered":"<p>Legal AI startups say their tools can absorb routine work. Crosby is releasing a benchmark to test whether lawyers should trust them.<\/p>\n","protected":false},"author":1,"featured_media":1430,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[4],"tags":[],"class_list":["post-1431","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-tech"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.6 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>One of legal&#039;s hottest startups is helping lawyers finally answer: Is the AI&#039;s work any good? - American home journals<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/americanhomejournals.com\/?p=1431\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"One of legal&#039;s hottest startups is helping lawyers finally answer: Is the AI&#039;s work any good? - American home journals\" \/>\n<meta property=\"og:description\" content=\"Legal AI startups say their tools can absorb routine work. Crosby is releasing a benchmark to test whether lawyers should trust them.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/americanhomejournals.com\/?p=1431\" \/>\n<meta property=\"og:site_name\" content=\"American home journals\" \/>\n<meta property=\"article:published_time\" content=\"2026-06-17T12:34:33+00:00\" \/>\n<meta name=\"author\" content=\"admin\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"admin\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"4 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\\\/\\\/schema.org\",\"@graph\":[{\"@type\":\"Article\",\"@id\":\"https:\\\/\\\/americanhomejournals.com\\\/?p=1431#article\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/americanhomejournals.com\\\/?p=1431\"},\"author\":{\"name\":\"admin\",\"@id\":\"https:\\\/\\\/americanhomejournals.com\\\/#\\\/schema\\\/person\\\/a3bb30439d8074c26cddd2e4af7af957\"},\"headline\":\"One of legal&#8217;s hottest startups is helping lawyers finally answer: Is the AI&#8217;s work any good?\",\"datePublished\":\"2026-06-17T12:34:33+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\\\/\\\/americanhomejournals.com\\\/?p=1431\"},\"wordCount\":779,\"commentCount\":0,\"image\":{\"@id\":\"https:\\\/\\\/americanhomejournals.com\\\/?p=1431#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/americanhomejournals.com\\\/wp-content\\\/uploads\\\/2026\\\/06\\\/642ffa7b56f8072040006571438ebf76.webp\",\"articleSection\":[\"Tech\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\\\/\\\/americanhomejournals.com\\\/?p=1431#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\\\/\\\/americanhomejournals.com\\\/?p=1431\",\"url\":\"https:\\\/\\\/americanhomejournals.com\\\/?p=1431\",\"name\":\"One of legal's hottest startups is helping lawyers finally answer: Is the AI's work any good? - American home journals\",\"isPartOf\":{\"@id\":\"https:\\\/\\\/americanhomejournals.com\\\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\\\/\\\/americanhomejournals.com\\\/?p=1431#primaryimage\"},\"image\":{\"@id\":\"https:\\\/\\\/americanhomejournals.com\\\/?p=1431#primaryimage\"},\"thumbnailUrl\":\"https:\\\/\\\/americanhomejournals.com\\\/wp-content\\\/uploads\\\/2026\\\/06\\\/642ffa7b56f8072040006571438ebf76.webp\",\"datePublished\":\"2026-06-17T12:34:33+00:00\",\"author\":{\"@id\":\"https:\\\/\\\/americanhomejournals.com\\\/#\\\/schema\\\/person\\\/a3bb30439d8074c26cddd2e4af7af957\"},\"breadcrumb\":{\"@id\":\"https:\\\/\\\/americanhomejournals.com\\\/?p=1431#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\\\/\\\/americanhomejournals.com\\\/?p=1431\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/americanhomejournals.com\\\/?p=1431#primaryimage\",\"url\":\"https:\\\/\\\/americanhomejournals.com\\\/wp-content\\\/uploads\\\/2026\\\/06\\\/642ffa7b56f8072040006571438ebf76.webp\",\"contentUrl\":\"https:\\\/\\\/americanhomejournals.com\\\/wp-content\\\/uploads\\\/2026\\\/06\\\/642ffa7b56f8072040006571438ebf76.webp\",\"width\":1200,\"height\":600},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\\\/\\\/americanhomejournals.com\\\/?p=1431#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\\\/\\\/americanhomejournals.com\\\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"One of legal&#8217;s hottest startups is helping lawyers finally answer: Is the AI&#8217;s work any good?\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\\\/\\\/americanhomejournals.com\\\/#website\",\"url\":\"https:\\\/\\\/americanhomejournals.com\\\/\",\"name\":\"American home journals\",\"description\":\"\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\\\/\\\/americanhomejournals.com\\\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\\\/\\\/americanhomejournals.com\\\/#\\\/schema\\\/person\\\/a3bb30439d8074c26cddd2e4af7af957\",\"name\":\"admin\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/50b1ad2e498f523425ee0a8cc5180a210646db1622662a3d56cc405d3e0c346a?s=96&d=mm&r=g\",\"url\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/50b1ad2e498f523425ee0a8cc5180a210646db1622662a3d56cc405d3e0c346a?s=96&d=mm&r=g\",\"contentUrl\":\"https:\\\/\\\/secure.gravatar.com\\\/avatar\\\/50b1ad2e498f523425ee0a8cc5180a210646db1622662a3d56cc405d3e0c346a?s=96&d=mm&r=g\",\"caption\":\"admin\"},\"sameAs\":[\"http:\\\/\\\/americanhomejournals.com\"],\"url\":\"https:\\\/\\\/americanhomejournals.com\\\/?author=1\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"One of legal's hottest startups is helping lawyers finally answer: Is the AI's work any good? - American home journals","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/americanhomejournals.com\/?p=1431","og_locale":"en_US","og_type":"article","og_title":"One of legal's hottest startups is helping lawyers finally answer: Is the AI's work any good? - American home journals","og_description":"Legal AI startups say their tools can absorb routine work. Crosby is releasing a benchmark to test whether lawyers should trust them.","og_url":"https:\/\/americanhomejournals.com\/?p=1431","og_site_name":"American home journals","article_published_time":"2026-06-17T12:34:33+00:00","author":"admin","twitter_card":"summary_large_image","twitter_misc":{"Written by":"admin","Est. reading time":"4 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/americanhomejournals.com\/?p=1431#article","isPartOf":{"@id":"https:\/\/americanhomejournals.com\/?p=1431"},"author":{"name":"admin","@id":"https:\/\/americanhomejournals.com\/#\/schema\/person\/a3bb30439d8074c26cddd2e4af7af957"},"headline":"One of legal&#8217;s hottest startups is helping lawyers finally answer: Is the AI&#8217;s work any good?","datePublished":"2026-06-17T12:34:33+00:00","mainEntityOfPage":{"@id":"https:\/\/americanhomejournals.com\/?p=1431"},"wordCount":779,"commentCount":0,"image":{"@id":"https:\/\/americanhomejournals.com\/?p=1431#primaryimage"},"thumbnailUrl":"https:\/\/americanhomejournals.com\/wp-content\/uploads\/2026\/06\/642ffa7b56f8072040006571438ebf76.webp","articleSection":["Tech"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/americanhomejournals.com\/?p=1431#respond"]}]},{"@type":"WebPage","@id":"https:\/\/americanhomejournals.com\/?p=1431","url":"https:\/\/americanhomejournals.com\/?p=1431","name":"One of legal's hottest startups is helping lawyers finally answer: Is the AI's work any good? - American home journals","isPartOf":{"@id":"https:\/\/americanhomejournals.com\/#website"},"primaryImageOfPage":{"@id":"https:\/\/americanhomejournals.com\/?p=1431#primaryimage"},"image":{"@id":"https:\/\/americanhomejournals.com\/?p=1431#primaryimage"},"thumbnailUrl":"https:\/\/americanhomejournals.com\/wp-content\/uploads\/2026\/06\/642ffa7b56f8072040006571438ebf76.webp","datePublished":"2026-06-17T12:34:33+00:00","author":{"@id":"https:\/\/americanhomejournals.com\/#\/schema\/person\/a3bb30439d8074c26cddd2e4af7af957"},"breadcrumb":{"@id":"https:\/\/americanhomejournals.com\/?p=1431#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/americanhomejournals.com\/?p=1431"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/americanhomejournals.com\/?p=1431#primaryimage","url":"https:\/\/americanhomejournals.com\/wp-content\/uploads\/2026\/06\/642ffa7b56f8072040006571438ebf76.webp","contentUrl":"https:\/\/americanhomejournals.com\/wp-content\/uploads\/2026\/06\/642ffa7b56f8072040006571438ebf76.webp","width":1200,"height":600},{"@type":"BreadcrumbList","@id":"https:\/\/americanhomejournals.com\/?p=1431#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/americanhomejournals.com\/"},{"@type":"ListItem","position":2,"name":"One of legal&#8217;s hottest startups is helping lawyers finally answer: Is the AI&#8217;s work any good?"}]},{"@type":"WebSite","@id":"https:\/\/americanhomejournals.com\/#website","url":"https:\/\/americanhomejournals.com\/","name":"American home journals","description":"","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/americanhomejournals.com\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/americanhomejournals.com\/#\/schema\/person\/a3bb30439d8074c26cddd2e4af7af957","name":"admin","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/secure.gravatar.com\/avatar\/50b1ad2e498f523425ee0a8cc5180a210646db1622662a3d56cc405d3e0c346a?s=96&d=mm&r=g","url":"https:\/\/secure.gravatar.com\/avatar\/50b1ad2e498f523425ee0a8cc5180a210646db1622662a3d56cc405d3e0c346a?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/50b1ad2e498f523425ee0a8cc5180a210646db1622662a3d56cc405d3e0c346a?s=96&d=mm&r=g","caption":"admin"},"sameAs":["http:\/\/americanhomejournals.com"],"url":"https:\/\/americanhomejournals.com\/?author=1"}]}},"_links":{"self":[{"href":"https:\/\/americanhomejournals.com\/index.php?rest_route=\/wp\/v2\/posts\/1431","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/americanhomejournals.com\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/americanhomejournals.com\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/americanhomejournals.com\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/americanhomejournals.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=1431"}],"version-history":[{"count":0,"href":"https:\/\/americanhomejournals.com\/index.php?rest_route=\/wp\/v2\/posts\/1431\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/americanhomejournals.com\/index.php?rest_route=\/wp\/v2\/media\/1430"}],"wp:attachment":[{"href":"https:\/\/americanhomejournals.com\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=1431"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/americanhomejournals.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=1431"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/americanhomejournals.com\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=1431"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}