COLA Cloud does the data dirty work
so you can focus on building differentiated value.
COLA Cloud captures, cleans and enriches the TTB's Public COLA Registry and delivers the data to your data warehouse. Extensive enrichments provide commerce-oriented features that are absent in the public registry.
Our pipeline leverages best in class image and language processing tools like Google Vision AI and OpenAI's Chat GPT to extract dozens of additional product features directly from label images.
Label images are available via AWS S3.
SELECT
colas.ttb_id,
colas.brand_name,
colas.front_label_text,
colas.llm_tasting_note_flavors
FROM colas
Go hands-on in the COLA Cloud web app and explore the dataset interactively. Configure saved searches to keep tabs on the market niches that matter to your business.
Free accounts gain demo access to 160K records from the year 2018. Subscribe to access the full catalog, updated daily.
Powered by the same dataset in the data product.
The Alcohol and Tobacco Tax and Trade Bureau (TTB) ensures the compliance of alcohol producers with federal labeling, production, and distribution laws.
The TTB's Certificate of Label Approval (COLA) process requires producers to submit prospective product labels for any beverages sold across state lines in order to verify their legibility, marketing claims, health warnings, and other details.
The Public COLA Registry is a (clunky) public access portal where users can search and access COLA submissions. It contains millions of records, and adds a few thousand records every week. Each submission contains some limited structured information about the product, along with the submitted label image files.
COLA Cloud captures all these records, extracts additional data points, and produces a remastered dataset in the cloud, ready to integrate to your business systems.
COLA Cloud was developed by an ex-Drizly, ex-Gopuff (Bevmo) Analytics Engineer with years of experience supporting Analysts and Data Scientists with solid and ergonomic datasets.
Maintaining a comprehensive catalog of alcohol products is a constant up-hill battle. COLA Cloud is a the largest automated input to catalog maintenance out there. With 450K unique barcodes already paired with deep product information, there are only a few steps to immediately and continuously enriching your catalog with this public dataset.
Curious about what organic wines out of Oregon are coming to market in the next month? Or every brand that making Milkshake IPAs out of Vermont? COLA Cloud captures label approvals before products hit shelves. Query your way to promising supplier relationships, and stay up-to-date on the latest trends in any tiny dimension of the market.
Use the COLA Cloud web app to automate high fidelity insights into the domestic market. Dial in your search parameters, and save the search to revisit whenever you like. For more advanced analysis get the full dataset in Snowflake and perform arbitrary analytical queries.