Insights

Research, analysis, and perspectives from Bommarito Consulting on AI, governance, privacy, and computational modeling.

AI Governance 2024-11-14 · 2 min

EU AI Code of Practice Draft: What Boards Should Know

An analysis of the EU AI Office's draft General-Purpose AI Code of Practice, highlighting systemic risk oversight, documentation requirements, incident response, and board-level compliance considerations.

AI Governance 2024-10-11 · 2 min

AI Lifecycle and the Board's Role

A primer for board directors on the AI lifecycle — data collection, training, and deployment — and the strategic considerations boards must understand for effective AI oversight.

AI Governance 2024-10-10 · 2 min

AI Oversight: 5 Key Sources of Board Requirements

A framework identifying the five key sources of AI governance requirements for boards — legal mandates, risk frameworks, insurance, internal policies, and customer preferences.

AI Governance 2024-10-09 · 2 min

Risk Management for AI: A Board Director's Guide

A comprehensive guide for board directors on leading AI risk management through six key elements: establishing context, risk assessment, risk treatment, recording, communication, and continuous monitoring.

Open Source 2024-09-01 · 2 min

The KL3M Data Project: Building Copyright-Clean Training Data at Scale

Inside the KL3M Data Project — assembling 132M+ copyright-clean documents to enable responsible LLM training and earning the first 'Fairly Trained' certification.

AI Governance 2022-03-30 · 2 min

How Data Provenance Drives Machine Learning Risk and Value

An exploration of data provenance — the origin and history of data — and why it is a board-level concern for AI risk management, legal compliance, and responsible governance.

AI Governance 2021-10-18 · 2 min

Pioneering Responsible Data Science: A Framework for Ethical Innovation

Introducing an open-source Responsible Data Science Policy Framework designed to help organizations address ethical AI governance through a modular, adaptable approach.

Firm Update 2019-01-02 · 1 min

Since Our Last Episode: The Evolution of Bommarito Consulting

A retrospective on the firm's journey from early-stage consulting through the LexPredict era, and the pivot toward AI governance, privacy, and institutional advisory.

Entrepreneurship 2018-06-15 · 2 min

Built to Sell: Lessons from the LexPredict Journey

Key takeaways from building, scaling, and successfully exiting LexPredict — an AI-powered legal technology company acquired in 2018.

Research 2017-03-10 · 2 min

Predicting the Supreme Court: AI Meets Legal Outcomes

How our machine learning research achieved breakthrough results in predicting Supreme Court decisions, and what it means for the future of legal AI.

Research 2015-01-27 · 1 min

Course Material for Complex Systems 530 — Computer Modeling for Complex Systems

Open course material for Complex Systems 530 at the University of Michigan, covering agent-based, Monte Carlo, and network modeling in Python.

Research 2014-07-27 · 2 min

Predicting the Behavior of the Supreme Court of the United States: A General Approach

Introducing our Supreme Court prediction project using extremely randomized trees to forecast over sixty years of decisions and individual justice votes.

Research 2014-06-03 · 2 min

Featured in Wired: Measuring the Complexity of the Law

Sam Arbesman featured our paper on measuring legal complexity of the United States Code on his Wired Science blog, the Social Dimension.

Research 2013-08-19 · 2 min

Is the Tax Code the Longest Title?

An analysis of whether the Internal Revenue Code (Title 26) is actually the longest Title in the U.S. Code, using data from our paper on measuring legal complexity.

Research 2013-08-13 · 2 min

Measuring the Complexity of the Law: The U.S. Code

Releasing our empirical framework for measuring legal complexity, applied to the U.S. Code, with full replication source and data on GitHub.

Research 2012-06-17 · 2 min

Summary of Community Detection Algorithms in igraph 0.6

A reference guide to the community detection algorithms available in igraph 0.6, including their runtime complexity, and support for directed and weighted edges.

Research 2012-04-15 · 3 min

Building an AWS CloudSearch Domain for the Supreme Court

A step-by-step tutorial for building a fully searchable AWS CloudSearch domain using public domain U.S. Supreme Court decisions, covering data acquisition, domain configuration, indexing, and querying.

Research 2012-02-13 · 1 min

Statistics on the Length and Linguistic Complexity of Bills

An introduction to an automated daily analysis of 112th Congress bills, providing statistics on section count, word count, unique words, and Flesch-Kincaid reading level for every bill.

Research 2011-12-16 · 4 min

Building Legal Language Explorer: Interactivity and Drill-Down, noSQL and SQL

A technical deep-dive into the architecture of the Legal Language Explorer, a Google Ngrams-style viewer for the U.S. Supreme Court corpus, combining redis (noSQL) for fast time series queries with PostgreSQL for case-level drill-down.

Research 2011-11-13 · 2 min

21st Century Legal Informatics: Part 1, Introduction

An introduction to three paradigms of legal informatics — 20th century computers-as-libraries, 22nd century computers-as-lawyers, and the practical 21st century middle ground.

Research 2011-05-04 · 3 min

Historical Data Mining the Supreme Court Headnotes

A demonstration of historical data mining techniques using editorially-assigned legal headnotes from U.S. Supreme Court cases, examining co-occurrence and citation networks to trace the evolution of legal concepts like criminal suspect rights.

Research 2011-04-18 · 3 min

Two New Papers on SSRN: Measuring EU Integration Through Sovereign Debt & Exploring Relationships Between Headnotes in the Supreme Court

Announcing two working papers released on SSRN: one measuring European integration through sovereign bond yield correlations from 1872 to 2010, and another exploring the network structure of legal concepts in Supreme Court headnotes.

Research 2011-04-15 · 2 min

Now in Print: An Empirical Survey of the Population of U.S. Tax Court Written Decisions

Announcing the publication of an empirical analysis of U.S. Tax Court citation practices from 1990 to 2008 in the Virginia Tax Review, examining Internal Revenue Code citation frequency and the impact of tax legislation on court decisions.

Research 2011-04-10 · 2 min

Building a Better Legal Search Engine, Part 1: Searching the U.S. Code

An introduction to indexing and searching the U.S. Code using Apache Lucene, structured public domain data, and open source software.

Let's Work Together

We'd welcome the opportunity to discuss how we can help your organization navigate the intersection of technology, governance, and strategy.

Insights

EU AI Code of Practice Draft: What Boards Should Know

AI Lifecycle and the Board's Role

AI Oversight: 5 Key Sources of Board Requirements

Risk Management for AI: A Board Director's Guide

The KL3M Data Project: Building Copyright-Clean Training Data at Scale

How Data Provenance Drives Machine Learning Risk and Value

Pioneering Responsible Data Science: A Framework for Ethical Innovation

Since Our Last Episode: The Evolution of Bommarito Consulting

Built to Sell: Lessons from the LexPredict Journey

Predicting the Supreme Court: AI Meets Legal Outcomes

Course Material for Complex Systems 530 — Computer Modeling for Complex Systems

Predicting the Behavior of the Supreme Court of the United States: A General Approach

Featured in Wired: Measuring the Complexity of the Law

Is the Tax Code the Longest Title?

Measuring the Complexity of the Law: The U.S. Code

Summary of Community Detection Algorithms in igraph 0.6

Building an AWS CloudSearch Domain for the Supreme Court

Statistics on the Length and Linguistic Complexity of Bills

Building Legal Language Explorer: Interactivity and Drill-Down, noSQL and SQL

21st Century Legal Informatics: Part 1, Introduction

Historical Data Mining the Supreme Court Headnotes

Two New Papers on SSRN: Measuring EU Integration Through Sovereign Debt & Exploring Relationships Between Headnotes in the Supreme Court

Now in Print: An Empirical Survey of the Population of U.S. Tax Court Written Decisions

Building a Better Legal Search Engine, Part 1: Searching the U.S. Code

Archive

Advanced Approximate Sentence Matching in Python

Fuzzy Match Sentences in Python

Isotonic Regressions in scikit-learn

Is There Tax in the Cloud?

Law's Future from Finance's Past: Recorded Talk from Reinvent Law Silicon Valley

Connecting R to an Oracle Database with RJDBC

eDiscovery Consulting in the Cloud: Searching an Outlook Mailbox and Attachments

“Google” for Subpoenaed Emails: AWS CloudSearch for eDiscovery

Saving Memory in Redis and Python with struct.pack

Tracking the Frequency of Twitter Hashtags with R

Plotting 3D Graphs with Python, igraph, and Cairo

Let's Work Together