Show HN: PII Detective, Leveraging LLMs for Cost-Effective PII Detection
6 points by kpolls 8 months ago | 0 commentsPII Detective is a web application designed to identify, classify, and protect Personally Identifiable Information (PII) in data platforms such as BigQuery and Snowflake. It leverages LLMs to identify PII column names, and with human-in-the-loop validation, uses Dynamic Data Masking Policies to easily enforce Access Control Limits (ACLs) while minimizing user friction.
For comparison, GCP has a "Sensitive Data Protection" service which promises similar functionality, but it can become extremely costly since it runs hundreds of regex queries on the entire contents of the table. For comparison, PII Detective only uses table metadata such as table and columns names, so you can detect PII in thousands of tables for less than $5 of OpenAI credits!
More info and a demo in the github link. Cheers!