LightBeam Documentation
Installer GuidesData SourcesPlaybooksInsightsPrivacyOpsGovernance
  • 💡What is LightBeam?
  • 🚀Getting Started
    • ⚙️Installer Guides
      • Pre-Requisites / Security Configurations
        • Firewall Requirements
        • Securing LightBeam on EKS with AWS Certificate Manager on Elastic Load Balancer
        • Configure HTTPS for LightBeam Endpoint FQDN Standalone deployment
        • Using Custom Certificates with LightBeam
        • Securing LightBeam on GKE with Google Certificate Manager and GCE Ingress
      • Core
        • LightBeam Deployment Instructions
        • LightBeam Installer
        • Web App Deployment
        • LightBeam Diagnostics
        • LightBeam Cluster Backup & Restore using Velero
      • Platform Specific
        • AWS
        • Microsoft Azure
        • Google Cloud (GKE)
        • Standalone Virtual Machine
        • Deployment on an Existing Managed Kubernetes Cluster
        • Azure Marketplace Deployment
      • Integration and Setup
        • Setting Up AWS PrivateLink for RDS-EKS Interaction
        • Twingate and LightBeam Integration Guide
        • Data Subject Request Web Application Server
        • Generate CSR for LightBeam
  • 🧠Core Features
    • 🔦Spectra AI
      • 🔗Data Sources
        • Cloud Platforms
          • AWS Auto Discovery
          • GCP Auto Discovery
        • Databases and Datalakes
          • PostgreSQL
          • Aurora (PostgreSQL)
          • Snowflake
          • MS SQL
          • MySQL
          • Aurora (MySQL)
          • BigQuery
          • AWS Redshift
          • Oracle
          • DynamoDB
          • MongoDB
          • CosmosDB (PostgreSQL)
          • CosmosDB (MongoDB)
          • CosmosDB (NoSQL)
          • Looker
          • AWS Glue
          • Databricks
          • SAP HANA
          • CSV Files as a Datasource
        • Messaging
          • Gmail
          • Slack
          • MS Teams
          • MS Outlook
        • Developer Tools
          • Zendesk
          • ServiceNow
          • Jira
          • GitHub
          • Confluence
        • File Repositories
          • NetDocuments
          • AWS S3
          • Azure Blob
          • Google Drive
          • OneDrive
          • SharePoint
          • Viva Engage
          • Dropbox
          • Box
          • SMB
        • CRM
          • Hubspot
          • Salesforce
          • Automated Data Processing (ADP)
          • Marketo
          • Iterable
          • MS Dynamics 365 Sales
          • Salesforce Marketing Cloud
      • 🔔PlayBooks
        • What is LightBeam Playbooks?
        • Policy and Alerts
          • Types of Policies
          • How to create a rule set
            • File Extension Filter
          • Configuring Retention Policies
          • Viewing Alerts
          • Sub Alerts
            • Reassigning Sub-Alerts
            • Sub-alert States
          • Levels of Actions on Alerts
          • User Roles and Permissions
            • Admin View
            • Alert Owner View
            • Onboarding New Users
              • User Management
              • Okta Integration
              • Alert Assignment Settings
              • Email Notifications
            • Planned Enhancements
          • Audit Logs
          • No Scan List
          • Permit List
          • Policy in read-only mode
      • 📊Insights
        • Entity Workflow
        • Document Classification
        • Attribute Management Overview
          • Attributes Page View
          • Attribute Sets
          • Creating Custom Attribute
          • Attributes List
        • Template Builder
        • Label Management
          • MIP Integration
          • Google Labels Integration
      • 🗃️Reporting
        • Delta Reporting
        • Executive Report
        • LightBeam Lens
      • Scanning and Redaction of Files
        • On-demand scanning
      • How-to Guides
        • Leveraging LightBeam insights for structured data sources
      • LightBeam Dashboard Outlay
      • Risk Score
    • 🏛️PrivacyOps
      • Data Subject Request (DSR)
        • What is DSR?
        • Accessing the DSR Module
        • DSR Form Builder (DPO View)
          • Creating a New DSR Form
            • Using a Predefined Template
            • Creating a Custom Form
          • Form Configuration
          • Form Preview and Publishing
          • Multi-Form Management
          • Messaging Templates
        • Form Submission & Email Verification (Data Subject View)
        • DSR Management Dashboard (DPO View)
        • Processing DSR Requests
          • Data Protection Officer (DPO) Workflow
          • Self Service Workflow (Direct Validation)
          • Data Source Owner (DSO) Workflow
        • DSR Report
      • 🚧Consent Management
        • Overview
        • Consent Logs
        • Preference Centre
        • Settings
      • 🍪Cookie Consent
        • Dashboard
        • Banners
        • Domains
        • Settings
        • CMP Deployment Guide for Google Tag Manager
        • FAQs
      • 🔏Privacy Impact Assessment (PIA)
        • PIA Templates
        • PIA Assessment Workflow
        • Collaborator View
        • Process Owner Login View (With Collaborator)
        • Filling questionnaire without collaborator
        • Submitting the assessment for DPO review
        • DPO review process
        • Marking the assessment as reviewed
        • Editing and resubmitting assessments after DPO review
        • Revoke review request
        • Edit Reviewer
        • PIA Reports
      • ⏺️Records of Processing Activity (RoPA)
        • Creating a RoPA Template
          • How to clone a template
          • How to use a template
        • How to create a process
          • Adding Process Details
          • Adding Data Elements
          • Adding Data Subjects
          • Adding Data Retention
          • Adding Safeguards
          • Adding Transfers
          • Adding a Custom Section
          • Setting a Review Schedule
          • Data Flow Diagram
        • How to add a collaborator
        • Overview Section
        • Generating a RoPA Report Using LightBeam
        • Collaborator working on a ticket
    • 🛡️Governance
      • Access
        • Dashboard
        • Users
        • Groups
        • Objects
        • Active Directory Settings
        • Access Governance at a Data Source Level
        • Policies and Alerting
        • Access Governance Statistics
        • Governance Module Dashboard
      • Privacy At Partners
  • 📊Tools & Resources
    • 🔀API Documentation
      • API to Create Reports for Structured Datasource
    • ❓Onboarding Assessments
      • Structured Datasource Onboarding Questionnaire
        • MongoDB/CosmosDB Questionnaire
        • Oracle Datasource Questionnaire
      • SMB Questionnaire
    • 🛠️Administration
      • Audit Logs
      • SMTP
        • Basic and oAuth Configuration
      • User Management
        • SAML Identity Providers
          • Okta
            • LightBeam Okta SAML Configuration Guide
          • Azure
            • Azure AD SAML Configuration for LightBeam
          • Google
            • Google IDP
        • Local User Management
          • Adding a User to the LightBeam Dashboard
          • Reset Default Admin Password
  • 📚Support & Reference
    • 📅Release Notes
      • LightBeam v2.2.0
      • Reporting Release Notes
      • Q1 2024 Key Enhancements
      • Q2 2024 Key Enhancements
      • Q3 2024 Key Enhancements
      • Q4 2024 Key Enhancements
    • 📖Glossary
Powered by GitBook
On this page
  • Overview
  • Connecting Google Drive Data Source
  • Appendix
  • Create Service Account
  • Onboarding Google Drive Datasource
  • About LightBeam
  1. Core Features
  2. Spectra AI
  3. Data Sources
  4. File Repositories

Google Drive

Connecting Google Drive to LightBeam

PreviousAzure BlobNextOneDrive

Last updated 1 month ago


Overview

LightBeam Spectra users can connect various data sources to the LightBeam application and these data sources will be continuously monitored for PII, PHI data.

Example: Google Drive, OneDrive, SharePoint, etc


Connecting Google Drive Data Source

  1. Login to your LightBeam Instance.

  2. Click on DATASOURCES on the Top Navigation Bar.

  3. Click on “Add a data source”.

  1. Search for “Google Drive”.

  1. Click on Google Drive.

  1. Fill in the requested information and click on Next.

Basic Information

  1. Data Source Name: This is the unique name given to the data source.

  2. Description: This is an optional field needed to describe the use of this data source.

  3. Primary Owner: Email address of the person responsible for this data source which will get alerts by default.

  4. Entity Creation: LightBeam Spectra detects and associates attributes based on the context and identifies whose data it is; these are called entities. Example: Jane Doe is an entity for whom LightBeam Spectra might have detected Name and SSN in a monitored data source.

  5. Source of Truth: LightBeam Spectra would have monitored data sources that contain data acting as a single point of truth and that can be used for looking up entities/attributes which help to identify if the other attributes/entities found in any other data source are accurate or not. A Source of Truth data set would create entities based on the attributes found in the data.

  6. Location: The location of the data source.

  7. Purpose: The purpose of the data being collected/processed.

  8. Stage: The stage of the data source. Example: Source, Processing, Archival, etc.

  1. Provide the credentials as shown below and click on Test Connection.

  1. Verify that you get the message Connection Success! on the screen. Click on Next.

  2. In this step, you can choose either of three scan setting options –

i) Scan all Drives

ii) Scan selected Drives

iii) Scan folder

To choose option (i), select Scan all Drives, and click on Save.

To choose option (ii), select Scan selected drives. Now enter the names of the drives that you would like to include for scanning in the Search box individually.

Select the drives by ticking the checkboxes next to them.

Click on Save.

To choose option (iii), select Scan folder.

This is a two-fold process:

  • To add the drive containing the folder you want to scan, first follow the instructions in option (ii).

  • Now enter the ID of the folder that you would like to include for scanning in the Search box. ID can be found the folder URL. Example: https://drive.google.com/drive/folders/<Folder_ID>

Click on Save.

Now we are ready to browse through onboarded Google Drive data source dashboard.


Appendix

Service account json creation for Google Drive

This document describes the steps to generate the service account json required to connect to and call Google APIs to access various Google services.

Success Tip: Depending on the services you plan to use, choose the corresponding subscription plan for your GSuite account. You must select the GSuite Business subscription if you want a use case to access Audit API reports, let users create shared drives, etc.

Create Service Account

Create Project and Enable API Services

Make sure the following APIs are enabled:

  • Google Drive API

  • Admin SDK API

  • Gmail API

  • Drive Labels API

Create Application within Project

  1. Click on CONFIGURE CONSENT SCREEN which is shown at the top as a warning. Choose User Type as Internal and click on CREATE.

  • Give a name to your application. e.g. demo-application.

  • Choose/Write the logged-in admin user’s email as the value for the mandatory fields of User support email and Developer contact information.

  • Click on SAVE AND CONTINUE.

  • Skip the next screens and come back to the Credentials page.

  1. Create a service account. Click on the CREATE CREDENTIALS > Service account.

  • Give a name and description and click on CREATE.

  • Select a role for this service account. You might see an option labeled 'Currently used', and the role value could be 'Owner'. This is because we've logged in using an admin account and created the project. Click on CONTINUE.

  • You can also add more users (apart from the logged-in admin user) to the service account. This step is optional.

  • Click on DONE.

Create Service Account

  1. You will now be redirected to the credentials page and observe that a service account is created. Click to edit the created service account.

  • In the pre-selected DETAILS tab, click on the advanced settings drop-down

CREATE GOOGLE WORKSPACE MARKETPLACE- COMPATIBLE OAUTH CLIENT

  • There is another section called Keys.

Click on ADD KEY → Create new key → Choose Key type as JSON → CREATE.

A service account JSON will be downloaded.

Note: Make sure you do not lose this key and that you keep it private and secure.

This file will be used as <inputfile> in the last command mentioned in the document.

You will see that an Oauth 2.0 Client ID is created as a result of the previous step of creating a service account.

Add Permissions and Scopes to the Client ID

Sign in with an administrator account.

click on Main Menu (This is located at the top left section of the page and is a hamburger menu option) → Security → Access and Data control → API Controls.

Click on Manage domain-wide delegation.

  • You will see a screen where all the clients for your account are listed. You will need to add scopes for the newly created client whose Client Id has already been copied to your clipboard (see annotated text above in Create Service Account). Here, click on Add new.

  • Paste the Client Id in the corresponding placeholder.

  • Enter the below OAuth scopes in a comma-separated fashion in the second field:

OAuth scopes

https://www.googleapis.com/auth/admin.directory.group.readonly,

https://www.googleapis.com/auth/drive.readonly,

https://www.googleapis.com/auth/admin.reports.audit.readonly,

https://www.googleapis.com/auth/admin.directory.user.readonly,

https://www.googleapis.com/auth/admin.directory.group.member.readonly

https://www.googleapis.com/auth/gmail.readonly,

https://www.googleapis.com/auth/drive,

https://www.googleapis.com/auth/drive.file,

https://www.googleapis.com/auth/drive.metadata,

https://www.googleapis.com/auth/drive.labels,

https://www.googleapis.com/auth/drive.labels.readonly,

https://www.googleapis.com/auth/drive.admin.labels,

https://www.googleapis.com/auth/drive.admin.labels.readonly

  • Click on AUTHORIZE.

Then, select the configured client and click on “view details” to make sure all OAuth scopes are configured correctly, it should look like:

Configured OAuth Scopes

Fetch the Auth Values to connect to LightBeam

base64 -i inputfile -o outfile

OR

openssl base64 -A -in <inputfile> -out <outfile>

Save the outfile which will be needed while onboarding the datasource.

After the generation of the service account JSON, one needs to use the same while onboarding the Google Drive datasource into LightBeam. We explain the steps for this in the following section.

Onboarding Google Drive Datasource

To onboard the Google Drive datasource in LightBeam we need the following:

  1. Delegated credentials.

The following explains the meaning of delegate credentials and their use with LightBeam:

Delegated credentials: This field is the email address of the user who is the Google account admin for the organization. In most cases, this email id is the same as that of the user who helped generate the service account credentials. If not the admin, the email id must be of a user who at minimum has permission for Groups and Services in the Admin portal. Attached below is a screenshot of how the config for this kind of user would look in the Admin portal.

How does LightBeam use the Delegated credentials? For accessing the Gmail data of users in the organization a service account is created, and it needs to be given domain-wide delegation. Domain-wide delegation allows a service account to access user data on behalf of any user in a Google Apps domain without requiring consent from every user. This email represents the user on behalf of whom the service account would be accessing various google api calls like listing users, drives etc.


About LightBeam

LightBeam automates Privacy, Security, and AI Governance, so businesses can accelerate their growth in new markets. Leveraging generative AI, LightBeam has rapidly gained customers’ trust by pioneering a unique privacy-centric and automation-first approach to security. Unlike siloed solutions, LightBeam ties together sensitive data cataloging, control, and compliance across structured and unstructured data applications providing 360-visibility, redaction, self-service DSRs, and automated ROPA reporting ensuring ultimate protection against ransomware and accidental exposures while meeting data privacy obligations efficiently. LightBeam is on a mission to create a secure privacy-first world helping customers automate compliance against a patchwork of existing and emerging regulations.

Figure 2.1 Google Drive
Figure 4. LightBeam Google Drive - Test Connection
Figure 5.1 LightBeam Google Drive - Scan Settings
Figure 5.2 LightBeam Google Drive - Scan Settings
Figure 5.3 LightBeam Google Drive - Scan Settings

Note: To get the Google Drive data source details please check .

As a Google Admin User: Create a project if the one needed does not exist already:

Select the newly created project and click on ENABLE APIS AND SERVICES on this link .

Generate credentials on

Figure.6 Create Google Workspace Marketplace-Compatible Oauth Client

Go to the credential page URL

Figure.7 Google APIs - Credentials Page

To sign in to , use an administrator account for a managed Google service, such as Google Workspace or Cloud Identity.

On the page with the same logged-in admin user,

Figure.8 API Controls

,

,

,

,

,

,

,

,

,

,

,

Above is Including scope for .

Base64 encoded value of the service account json created (using the steps listed )

Figure.10 LightBeam Google Drive - Connection Details
Figure 11. Google Admin Configuration

For any questions or suggestions, please get in touch with us at: .

🧠
🔦
🔗
Appendix
https://console.developers.google.com/projectcreate?
https://console.developers.google.com/apis/dashboard
https://console.developers.google.com/apis/credentials
https://console.cloud.google.com/apis/credentials
admin.google.com
https://admin.google.com/
https://www.googleapis.com/auth/admin.directory.group.readonly
https://www.googleapis.com/auth/drive.readonly
https://www.googleapis.com/auth/admin.reports.audit.readonly
https://www.googleapis.com/auth/admin.directory.user.readonly
https://www.googleapis.com/auth/gmail.readonly
https://www.googleapis.com/auth/drive
https://www.googleapis.com/auth/drive.file
https://www.googleapis.com/auth/drive.metadata
https://www.googleapis.com/auth/drive.labels
https://www.googleapis.com/auth/drive.labels.readonly
https://www.googleapis.com/auth/drive.admin.labels
https://www.googleapis.com/auth/drive.admin.labels.readonly
support@lightbeam.ai
here
Google labels integration
Figure 1. Add Data Source
Figure 2. Search for Google Drive
Figure 3. LightBeam Google Drive - Basic Information
Figure.9 OAuth Scopes Configuration