There is no shortage of AI benchmarks in the market today, with popular options like Humanity's Last Exam (HLE), ARC-AGI-2 and GDPval, among numerous others. AI agents excel at solving abstract math ...
New Agent Bricks features — Agent-as-a-Judge, Tunable Judges, and Judge Builder — are designed to help enterprises fine-tune agent performance and align AI behavior with business-specific standards.
Databricks Inc. today announced a series of updates to its flagship artificial intelligence product, Agent Bricks, aimed at improving governance, accuracy and model flexibility for enterprise AI ...
Databricks finds itself in an awkward situation following the departure of Naveen Rao, its head of artificial intelligence, as rivals like Snowflake, Teradata, and hyperscalers such as AWS, Azure, and ...