LightBeam Documentation
Installer GuidesData SourcesPlaybooksInsightsPrivacyOpsGovernance
  • 💡What is LightBeam?
  • 🚀Getting Started
    • ⚙️Installer Guides
      • Pre-Requisites / Security Configurations
        • Firewall Requirements
        • Securing LightBeam on EKS with AWS Certificate Manager on Elastic Load Balancer
        • Configure HTTPS for LightBeam Endpoint FQDN Standalone deployment
        • Using Custom Certificates with LightBeam
        • Securing LightBeam on GKE with Google Certificate Manager and GCE Ingress
      • Core
        • LightBeam Deployment Instructions
        • LightBeam Installer
        • Web App Deployment
        • LightBeam Diagnostics
        • LightBeam Cluster Backup & Restore using Velero
      • Platform Specific
        • AWS
        • Microsoft Azure
        • Google Cloud (GKE)
        • Standalone Virtual Machine
        • Deployment on an Existing Managed Kubernetes Cluster
        • Azure Marketplace Deployment
      • Integration and Setup
        • Setting Up AWS PrivateLink for RDS-EKS Interaction
        • Twingate and LightBeam Integration Guide
        • Data Subject Request Web Application Server
        • Generate CSR for LightBeam
  • 🧠Core Features
    • 🔦Spectra AI
      • 🔗Data Sources
        • Cloud Platforms
          • AWS Auto Discovery
          • GCP Auto Discovery
        • Databases and Datalakes
          • PostgreSQL
          • Aurora (PostgreSQL)
          • Snowflake
          • MS SQL
          • MySQL
          • Aurora (MySQL)
          • BigQuery
          • AWS Redshift
          • Oracle
          • DynamoDB
          • MongoDB
          • CosmosDB (PostgreSQL)
          • CosmosDB (MongoDB)
          • CosmosDB (NoSQL)
          • Looker
          • AWS Glue
          • Databricks
          • SAP HANA
          • CSV Files as a Datasource
        • Messaging
          • Gmail
          • Slack
          • MS Teams
          • MS Outlook
        • Developer Tools
          • Zendesk
          • ServiceNow
          • Jira
          • GitHub
          • Confluence
        • File Repositories
          • NetDocuments
          • AWS S3
          • Azure Blob
          • Google Drive
          • OneDrive
          • SharePoint
          • Viva Engage
          • Dropbox
          • Box
          • SMB
        • CRM
          • Hubspot
          • Salesforce
          • Automated Data Processing (ADP)
          • Marketo
          • Iterable
          • MS Dynamics 365 Sales
          • Salesforce Marketing Cloud
      • 🔔PlayBooks
        • What is LightBeam Playbooks?
        • Policy and Alerts
          • Types of Policies
          • How to create a rule set
            • File Extension Filter
          • Configuring Retention Policies
          • Viewing Alerts
          • Sub Alerts
            • Reassigning Sub-Alerts
            • Sub-alert States
          • Levels of Actions on Alerts
          • User Roles and Permissions
            • Admin View
            • Alert Owner View
            • Onboarding New Users
              • User Management
              • Okta Integration
              • Alert Assignment Settings
              • Email Notifications
            • Planned Enhancements
          • Audit Logs
          • No Scan List
          • Permit List
          • Policy in read-only mode
      • 📊Insights
        • Entity Workflow
        • Document Classification
        • Attribute Management Overview
          • Attributes Page View
          • Attribute Sets
          • Creating Custom Attribute
          • Attributes List
        • Template Builder
        • Label Management
          • MIP Integration
          • Google Labels Integration
      • 🗃️Reporting
        • Delta Reporting
        • Executive Report
        • LightBeam Lens
      • Scanning and Redaction of Files
        • On-demand scanning
      • How-to Guides
        • Leveraging LightBeam insights for structured data sources
      • LightBeam Dashboard Outlay
      • Risk Score
    • 🏛️PrivacyOps
      • Data Subject Request (DSR)
        • What is DSR?
        • Accessing the DSR Module
        • DSR Form Builder (DPO View)
          • Creating a New DSR Form
            • Using a Predefined Template
            • Creating a Custom Form
          • Form Configuration
          • Form Preview and Publishing
          • Multi-Form Management
          • Messaging Templates
        • Form Submission & Email Verification (Data Subject View)
        • DSR Management Dashboard (DPO View)
        • Processing DSR Requests
          • Data Protection Officer (DPO) Workflow
          • Self Service Workflow (Direct Validation)
          • Data Source Owner (DSO) Workflow
        • DSR Report
      • 🚧Consent Management
        • Overview
        • Consent Logs
        • Preference Centre
        • Settings
      • 🍪Cookie Consent
        • Dashboard
        • Banners
        • Domains
        • Settings
        • CMP Deployment Guide for Google Tag Manager
        • FAQs
      • 🔏Privacy Impact Assessment (PIA)
        • PIA Templates
        • PIA Assessment Workflow
        • Collaborator View
        • Process Owner Login View (With Collaborator)
        • Filling questionnaire without collaborator
        • Submitting the assessment for DPO review
        • DPO review process
        • Marking the assessment as reviewed
        • Editing and resubmitting assessments after DPO review
        • Revoke review request
        • Edit Reviewer
        • PIA Reports
      • ⏺️Records of Processing Activity (RoPA)
        • Creating a RoPA Template
          • How to clone a template
          • How to use a template
        • How to create a process
          • Adding Process Details
          • Adding Data Elements
          • Adding Data Subjects
          • Adding Data Retention
          • Adding Safeguards
          • Adding Transfers
          • Adding a Custom Section
          • Setting a Review Schedule
          • Data Flow Diagram
        • How to add a collaborator
        • Overview Section
        • Generating a RoPA Report Using LightBeam
        • Collaborator working on a ticket
    • 🛡️Governance
      • Access
        • Dashboard
        • Users
        • Groups
        • Objects
        • Active Directory Settings
        • Access Governance at a Data Source Level
        • Policies and Alerting
        • Access Governance Statistics
        • Governance Module Dashboard
      • Privacy At Partners
  • 📊Tools & Resources
    • 🔀API Documentation
      • API to Create Reports for Structured Datasource
    • ❓Onboarding Assessments
      • Structured Datasource Onboarding Questionnaire
        • MongoDB/CosmosDB Questionnaire
        • Oracle Datasource Questionnaire
      • SMB Questionnaire
    • 🛠️Administration
      • Audit Logs
      • SMTP
        • Basic and oAuth Configuration
      • User Management
        • SAML Identity Providers
          • Okta
            • LightBeam Okta SAML Configuration Guide
          • Azure
            • Azure AD SAML Configuration for LightBeam
          • Google
            • Google IDP
        • Local User Management
          • Adding a User to the LightBeam Dashboard
          • Reset Default Admin Password
  • 📚Support & Reference
    • 📅Release Notes
      • LightBeam v2.2.0
      • Reporting Release Notes
      • Q1 2024 Key Enhancements
      • Q2 2024 Key Enhancements
      • Q3 2024 Key Enhancements
      • Q4 2024 Key Enhancements
    • 📖Glossary
Powered by GitBook
On this page
  • Overview
  • About Looker
  • Features
  • Onboarding Looker Data Source
  • APPENDIX
  • Minimal permissions setup
  • About LightBeam
  1. Core Features
  2. Spectra AI
  3. Data Sources
  4. Databases and Datalakes

Looker

Connecting Looker to LightBeam


Overview

LightBeam Spectra users can connect various data sources to the LightBeam application and these data sources will be continuously monitored for PII, PHI data.

Example: Looker, DynamoDB, Redshift, etc.

About Looker

Looker is a BI and data visualization platform by Google. It facilitates data connection and visualization across multiple sources. As an intermediary layer over structured data sources like Redshift, BigQuery, and MySQL, it enables the creation of custom data models. Users can perform SQL queries on these models, which Looker then translates for the underlying databases. By integrating into LightBeam, Looker serves as a structured data source by augmenting data analysis and visualization functionalities.

Features

Datasource Registration

Looker administrators create users with limited permissions, using the restricted user’s clientID and clientSecret for the registration process. The registration requires the Looker instance’s URL, clientID, and clientSecret. Users can actively select specific projects and models within those projects to scan.

Metadata Scanning

LightBeam systematically scans models defined in the scanning conditions. Each model is treated as a separate database in our system, named with the structure project_name.model_name. This naming convention incorporates the project name to provide a fully qualified name for each model, allowing users to easily identify the project a model belongs to by its name displayed in the UI.

  • Within each model, LightBeam examines all explores and views that are accessible through these explores.

  • Explores and views are regarded as tables in our system.

  • The naming convention for an explore follows its own name, whereas for a view, it adopts the explore_name.view_name format. This approach ensures that views remain unique, especially since a single view might be associated with multiple explores within the same model.

  • The scanning extends to dimensions within explores and views, which are interpreted as columns in our system, offering a granular look at the data structure.

PII Detection

Lightbeam retrieves sample data from each explore or view, classifying each dimension present within the explore or view to detect personally identifiable information (PII).

Limitations

  • Entity and Attribute Instance Creation: Not supported due to Looker API limitations that prevent the joining of different views.

  • Table Size Information: Unavailable as Looker's API does not provide the size details of explores or views.


Onboarding Looker Data Source

  1. Login to your LightBeam Instance.

  2. Click on DATASOURCES on the Top Navigation Bar.

  3. Click on “Add a data source”.

  1. Search for Looker.

  2. Click on Looker.

3. Fill in the details as shown below and click Next:

Basic Information

  1. Instance Name: This is the unique name given to the data source.

  2. Description: This is an optional field needed to describe the use of this data source.

  3. Primary Owner: Email address of the person responsible for this data source which will get alerts by default.

  4. Source of Truth: LightBeam Spectra would have monitored data sources that contain data acting as a single point of truth and that can be used for looking up entities/attributes that help to identify if the other attributes/entities found in any other data source are accurate or not. A Source of Truth data set would create entities based on the attributes found in the data.

  5. Location: The location of the data source.

  6. Purpose: The purpose of the data being collected/processed.

  7. Stage: The stage of the data source. Example: Source, Processing, Archival, etc.

4. In this step, insert the credentials as shown below and click Test Connection –

  1. Verify that you get the message Connection Success! on the screen. Click on Next.

  2. In the next step, you will see a list of databases presented from your Looker cluster. Fig 6. Looker - Select database

Displayed Databases: By default, all databases to which you have access permissions will be shown.

Custom Selection: If you wish not to scan certain databases, simply deselect them from the list.

Please verify that all databases selected for scanning show up in the list of databases. Ensure you've made your desired selections before connecting the datasource.

  1. Finally, click on Start Sampling to connect to the Looker datasource.


APPENDIX

Minimal permissions setup

To enable LightBeam to scan Looker data, a user with minimal permissions is necessary. Follow these steps to create such a user and generate the required API keys for integration with LightBeam.

  1. Role Creation

    • A role in Looker is defined by a combination of a Model Set and a Permission Set.

    • Navigate to the Admin Panel in Looker.

  2. Model Set Configuration

    • Access Admin → Users → Roles.

    • Click on New Model Set using the blue button at the top of the page.

    • In the new Model Set, select the models you intend to scan with LightBeam.

  3. Permission Set Configuration

    • Still in Admin → Users → Roles, select the New Permission Set with the blue button.

    • Configure a new Permission Set with the required permissions for Lightbeam scanning.

  4. Finalizing the Role

    • After setting up the Model Set and Permission Set, go to New Role via the blue button.

    • Create a new role that includes the permissions and model set specified in the previous steps.

  5. User Creation and Role Assignment

    • Under Admin → Users, navigate to Users.

    • Select Add Service Accounts with the blue button at the top.

    • Create a new user and assign the newly created role to this account.

  6. API Key Retrieval

    • Go to the newly created user's account page.

    • Click Edit API keys, then copy the clientID and clientSecret.

    • Securely store the copied keys for use while registering Looker with LightBeam.

By following these instructions, you will have successfully created a Looker user with minimal permissions, suitable for integrating with Lightbeam and conducting necessary scans.

Validate permissions to the database

Next, the user needs to validate these permissions to the datasource. This ensures authorized access to the datasource by the credentials provided by the user. After validating the permissions to the datasource, the user can onboard Looker in Lightbeam.

Steps

  1. Go into sql_user_check_looker directory

  2. Please refer to the README.md file in the directory for detailed instructions.


About LightBeam

LightBeam automates Privacy, Security, and AI Governance, so businesses can accelerate their growth in new markets. Leveraging generative AI, LightBeam has rapidly gained customers’ trust by pioneering a unique privacy-centric and automation-first approach to security. Unlike siloed solutions, LightBeam ties together sensitive data cataloging, control, and compliance across structured and unstructured data applications providing 360-visibility, redaction, self-service DSRs, and automated ROPA reporting ensuring ultimate protection against ransomware and accidental exposures while meeting data privacy obligations efficiently. LightBeam is on a mission to create a secure privacy-first world helping customers automate compliance against a patchwork of existing and emerging regulations.

PreviousCosmosDB (NoSQL)NextAWS Glue

Last updated 1 month ago

Figure 1. Add Data Source
Fig 6. Looker - Admin Panel
Fig 7. Looker - Model Set Configuration
Fig 8. Looker - Permission Set Configuration
Fig 9. Looker - Finalizing the Role
Fig 10. Looker - User Creation & Role Assignment
Fig 11. Looker - API Key Retrieval

First, clone the repository

For any questions or suggestions, please get in touch with us at: .

🧠
🔦
🔗
https://github.com/lightbeamai/lb-installer
support@lightbeam.ai
Figure 2. Search Looker
Figure 3. Click on Looker.
Figure 4. Looker - Basic Configuration
Fig 5. Looker - Connection details