In September’s edition of Third Thursday, Erin Sussman and Jeffrey Solomon joined us to share their key insights on developing strong billing guidelines:
1. Start with industry standards. Leverage trusted industry resources as a baseline to understand what’s considered “market standard.” Partners and advisors can also provide guidance on best practices for communication, rollout, and firm acceptance.
2. Conduct a historical analysis. Look back at past invoices to spot trends and problem areas where guidelines could help contain costs. Common examples include:
Excessive hours billed for legal research
High or poorly documented travel expenses
Overstaffing or disproportionate partner time
3. Engage key in-house legal team members. Engage key members of your legal department when drafting guidelines. Their firsthand insight into your matters and law firms’ billing behaviors will help ensure your policies are both relevant and enforceable. Just as important, their involvement fosters ownership and buy-in, which makes enforcement smoother.
How to take advantage of AI in the bill review process:
4. Combine AI with human insight. AI is powerful at spotting patterns and exceptions, but pairing it with human judgment creates even more flexibility in controlling nuanced or complex legal spend. This frees your team to spend less time policing invoices and more time practicing law.
5. Monitor which rules the AI flags most frequently. Keep track of which rules your AI solution flags most often. Are certain system-applied flags popping up again and again? That trend data can help you refine your guidelines, improve communication with firms, and sharpen your cost-control strategies.
With the right mix of industry benchmarks, internal input, and AI-driven review, billing guidelines become more than rules; they serve as a strategic lever to reduce costs, drive consistency, and strengthen firm relationships.
Presenters
Erin Sussman General Counsel, Director of Business Development and Client Service Sterling Analytics
MIT grabbed headlines recently by reporting that 95 percent of Generative AI pilots never drive measurable results, with only 5 percent of projects achieving real scale. Every stalled pilot drains budget, slows transformation, builds opposition, and piles pressure on legal ops leaders to prove ROI. Below, we examine what lessons from the study can be applied to your legal operations AI strategy:
As Nick Whitehouse, Chief AI Officer at Onit, explained in a recent response to the study: “Are 95% of generative AI projects failing? Nope, not really. The way we are measuring them and the way we are running them, yes, but the technology itself, no.”
Why Generative AI Pilots Fail in Legal Departments
AI that is disconnected from workflow
The MIT study found that the leading reason generative AI pilots fail is weak integration with workflows, not weak technology. If AI doesn’t live inside e-Billing, matter management, or contract review, it lives on the sidelines. Integration reduces the barriers to adoption, and adoption is impact. This is exactly why modern legal ops software that is built to connect workflows matter.
Chasing sizzle over substance
The same research showed that companies spend heavily on high-profile pilots in sales and marketing, while the biggest ROI sits in back-office automation. In legal, it can be tempting to tackle impressive use cases like advice giving and negotiation strategy. However, AI can deliver more value faster for invoice review, spend management, and contract analytics than “showcase” projects.
As Whitehouse noted, “The MIT study was actually helpful: it found the biggest wins often come from focusing on back-office tasks. These are well documented, process-driven, and usually have the data you need to measure results. That’s where success tends to show up first.”
The build versus buy dilemma
The difference between success for both build or buy is focus. Analysts noted that vendor tools succeed about twice as often. Internal builds can work, but only if they are targeted and resourced properly. General purpose AI models still require significant tuning, configuring, custom UX, and most importantly, scientific validation to maximize their value in a particular domain. This requires a domain specific focus and attention.
Tackling too much at once
Spreading resources too thin is another culprit. Startups focusing on one use case often scale quickly, reaching $20 million in revenue in under a year. Larger enterprises tend to pilot too many initiatives at once and struggle to show meaningful progress. Legal ops leaders who pick one workflow, deliver results, and scale from there build momentum and credibility.
As Onit’s Chief AI Officer pointed out, “About 5% of businesses in the study are doing an exceptionally good job using AI to solve real pain points. You see this especially in the startup ecosystem. In fact, the study showed that businesses working with specialized companies or products focused on these problems have twice the success rate compared to those going it alone.”
How to Make Sure Your Legal Ops Project is Among the 5% of Success Stories
Meanwhile, adoption pressure is only growing. A recent survey found that 96 percent of legal professionals already use AI in their daily workflows, and nearly half describe it as essential. Legal cannot afford to treat AI as optional. However, moving from experimentation to execution requires a structured approach.
Here’s how to set generative AI pilots up for success in legal ops:
Pinpoint one high-impact process
Look for workflows where time and cost converge, like invoice review or contract approvals.
Embed AI into the systems your team already use
If it isn’t integrated, it won’t be adopted. Seamless connection to your core legal tools is non-negotiable.
Measure what matters
Cycle-time reductions, outside counsel savings, compliance improvements. These are the metrics leadership cares about, not abstract productivity gains.
Fund what works
Double down on pilots that deliver measurable ROI and resist the urge to scatter budget across experiments.
Lead the adoption curve
Change management is not an afterthought. Equip your team with training, communicate wins, and build trust in the tools.
From Pilot Failure to Legal Ops Success
Loose generative AI pilots are failing at high rates, but that doesn’t mean legal ops is destined for the same fate. Infosys research shows that 95 percent of executives using AI have experienced a mishap, and only 2 percent of organizations meet responsible AI standards. Like the MIT study, that’s an explicit argument for structure, governance, and discipline.
“The right way to look at this is as a maturity journey: moving from ad hoc experiments to fully integrated AI in your core processes. That’s when you’ll start seeing massive success.” – Nick Whitehouse
Legal ops leaders who focus on integration, measure what matters, and scale intentionally will be the ones who prove AI’s real value.
We turned the key lessons from MIT’s AI pilot study into a quick summary slide.