Operationalize
the TTB COLA Registry

COLA Cloud does the data dirty work
so you can focus on building differentiated value.

Data Product

COLA Cloud captures, cleans and enriches the TTB's Public COLA Registry and delivers the data to your data warehouse. Extensive enrichments provide commerce-oriented features that are absent in the public registry. 

Our pipeline leverages best in class image and language processing tools like Google Vision AI and OpenAI's Chat GPT to extract dozens of additional product features directly from label images.

Label images are available via AWS S3.

SELECT
 colas.ttb_id,
 colas.brand_name,
 colas.front_label_text,
 colas.llm_tasting_note_flavors
FROM colas

Screenshot of the COLA Cloud web app featuring a search panel and a grid of search results each featuring a label image and title.

Web App

Go hands-on in the COLA Cloud web app and explore the dataset interactively. Configure saved searches to keep tabs on the market niches that matter to your business.

Free accounts gain demo access to 160K records from the year 2018. Subscribe to access the full catalog, updated daily.

Powered by the same dataset in the data product.

COLA Cloud App
2.6M+
Label Applications
4.6M+
Label Images
470K+
Unique Barcodes
3K / week
New COLA Approvals

Intro to the TTB Public COLA Registry

The Alcohol and Tobacco Tax and Trade Bureau (TTB) ensures the compliance of alcohol producers with federal labeling, production, and distribution laws.

The TTB's Certificate of Label Approval (COLA) process requires producers to submit prospective product labels for any beverages sold across state lines in order to verify their legibility, marketing claims, health warnings, and other details.

The Public COLA Registry is a (clunky) public access portal where users can search and access COLA submissions. It contains millions of records, and adds a few thousand records every week. Each submission contains some limited structured information about the product, along with the submitted label image files.

COLA Cloud captures all these records, extracts additional data points, and produces a remastered dataset in the cloud, ready to integrate to your business systems.

A set of federal forms for Certificate of Label Approval applications with the TTB's logo superimposed.

Enterprise Grade

COLA Cloud was developed by an ex-Drizly, ex-Gopuff (Bevmo) Analytics Engineer with years of experience supporting Analysts and Data Scientists with solid and ergonomic datasets.

A set of federal forms for Certificate of Label Approval applications with the TTB's logo superimposed.
A product label with every area of text boxed in various highlighter colors by a computer vision algorithm.
A lineage graph of data transformations with the logo for dbt, a popular data transformation framework, in the bottom corner.
A database entity relationship diagram, or ERD, illustrating the relationships between data tables.

Let's connect

all fields are required

Thank you! We'll get back to you as soon as possible.
Hmm something about that didn't work out. You can also email me directly at jay@colacloud.us

Use Cases

Product Catalog

Maintaining a comprehensive catalog of alcohol products is a constant up-hill battle. COLA Cloud is a the largest automated input to catalog maintenance out there. With 450K unique barcodes already paired with deep product information, there are only a few steps to immediately and continuously enriching your catalog with this public dataset.

Supplier Lead Generation

Curious about what organic wines out of Oregon are coming to market in the next month? Or every brand that making Milkshake IPAs out of Vermont? COLA Cloud captures label approvals before products hit shelves. Query your way to promising supplier relationships, and stay up-to-date on the latest trends in any tiny dimension of the market.

Competitive Intelligence

Use the COLA Cloud web app to automate high fidelity insights into the domestic market. Dial in your search parameters, and save the search to revisit whenever you like. For more advanced analysis get the full dataset in Snowflake and perform arbitrary analytical queries.