LightBeam Documentation
Installer GuidesData SourcesPlaybooksInsightsPrivacyOpsGovernance
  • 💡What is LightBeam?
  • 🚀Getting Started
    • ⚙️Installer Guides
      • Pre-Requisites / Security Configurations
        • Firewall Requirements
        • Securing LightBeam on EKS with AWS Certificate Manager on Elastic Load Balancer
        • Configure HTTPS for LightBeam Endpoint FQDN Standalone deployment
        • Using Custom Certificates with LightBeam
        • Securing LightBeam on GKE with Google Certificate Manager and GCE Ingress
      • Core
        • LightBeam Deployment Instructions
        • LightBeam Installer
        • Web App Deployment
        • LightBeam Diagnostics
        • LightBeam Cluster Backup & Restore using Velero
      • Platform Specific
        • AWS
        • Microsoft Azure
        • Google Cloud (GKE)
        • Standalone Virtual Machine
        • Deployment on an Existing Managed Kubernetes Cluster
        • Azure Marketplace Deployment
      • Integration and Setup
        • Setting Up AWS PrivateLink for RDS-EKS Interaction
        • Twingate and LightBeam Integration Guide
        • Data Subject Request Web Application Server
        • Generate CSR for LightBeam
  • 🧠Core Features
    • 🔦Spectra AI
      • 🔗Data Sources
        • Cloud Platforms
          • AWS Auto Discovery
          • GCP Auto Discovery
        • Databases and Datalakes
          • PostgreSQL
          • Aurora (PostgreSQL)
          • Snowflake
          • MS SQL
          • MySQL
          • Aurora (MySQL)
          • BigQuery
          • AWS Redshift
          • Oracle
          • DynamoDB
          • MongoDB
          • CosmosDB (PostgreSQL)
          • CosmosDB (MongoDB)
          • CosmosDB (NoSQL)
          • Looker
          • AWS Glue
          • Databricks
          • SAP HANA
          • CSV Files as a Datasource
        • Messaging
          • Gmail
          • Slack
          • MS Teams
          • MS Outlook
        • Developer Tools
          • Zendesk
          • ServiceNow
          • Jira
          • GitHub
          • Confluence
        • File Repositories
          • NetDocuments
          • AWS S3
          • Azure Blob
          • Google Drive
          • OneDrive
          • SharePoint
          • Viva Engage
          • Dropbox
          • Box
          • SMB
        • CRM
          • Hubspot
          • Salesforce
          • Automated Data Processing (ADP)
          • Marketo
          • Iterable
          • MS Dynamics 365 Sales
          • Salesforce Marketing Cloud
      • 🔔PlayBooks
        • What is LightBeam Playbooks?
        • Policy and Alerts
          • Types of Policies
          • How to create a rule set
            • File Extension Filter
          • Configuring Retention Policies
          • Viewing Alerts
          • Sub Alerts
            • Reassigning Sub-Alerts
            • Sub-alert States
          • Levels of Actions on Alerts
          • User Roles and Permissions
            • Admin View
            • Alert Owner View
            • Onboarding New Users
              • User Management
              • Okta Integration
              • Alert Assignment Settings
              • Email Notifications
            • Planned Enhancements
          • Audit Logs
          • No Scan List
          • Permit List
          • Policy in read-only mode
      • 📊Insights
        • Entity Workflow
        • Document Classification
        • Attribute Management Overview
          • Attributes Page View
          • Attribute Sets
          • Creating Custom Attribute
          • Attributes List
        • Template Builder
        • Label Management
          • MIP Integration
          • Google Labels Integration
      • 🗃️Reporting
        • Delta Reporting
        • Executive Report
        • LightBeam Lens
      • Scanning and Redaction of Files
        • On-demand scanning
      • How-to Guides
        • Leveraging LightBeam insights for structured data sources
      • LightBeam Dashboard Outlay
      • Risk Score
    • 🏛️PrivacyOps
      • Data Subject Request (DSR)
        • What is DSR?
        • Accessing the DSR Module
        • DSR Form Builder (DPO View)
          • Creating a New DSR Form
            • Using a Predefined Template
            • Creating a Custom Form
          • Form Configuration
          • Form Preview and Publishing
          • Multi-Form Management
          • Messaging Templates
        • Form Submission & Email Verification (Data Subject View)
        • DSR Management Dashboard (DPO View)
        • Processing DSR Requests
          • Data Protection Officer (DPO) Workflow
          • Self Service Workflow (Direct Validation)
          • Data Source Owner (DSO) Workflow
        • DSR Report
      • 🚧Consent Management
        • Overview
        • Consent Logs
        • Preference Centre
        • Settings
      • 🍪Cookie Consent
        • Dashboard
        • Banners
        • Domains
        • Settings
        • CMP Deployment Guide for Google Tag Manager
        • FAQs
      • 🔏Privacy Impact Assessment (PIA)
        • PIA Templates
        • PIA Assessment Workflow
        • Collaborator View
        • Process Owner Login View (With Collaborator)
        • Filling questionnaire without collaborator
        • Submitting the assessment for DPO review
        • DPO review process
        • Marking the assessment as reviewed
        • Editing and resubmitting assessments after DPO review
        • Revoke review request
        • Edit Reviewer
        • PIA Reports
      • ⏺️Records of Processing Activity (RoPA)
        • Creating a RoPA Template
          • How to clone a template
          • How to use a template
        • How to create a process
          • Adding Process Details
          • Adding Data Elements
          • Adding Data Subjects
          • Adding Data Retention
          • Adding Safeguards
          • Adding Transfers
          • Adding a Custom Section
          • Setting a Review Schedule
          • Data Flow Diagram
        • How to add a collaborator
        • Overview Section
        • Generating a RoPA Report Using LightBeam
        • Collaborator working on a ticket
    • 🛡️Governance
      • Access
        • Dashboard
        • Users
        • Groups
        • Objects
        • Active Directory Settings
        • Access Governance at a Data Source Level
        • Policies and Alerting
        • Access Governance Statistics
        • Governance Module Dashboard
      • Privacy At Partners
  • 📊Tools & Resources
    • 🔀API Documentation
      • API to Create Reports for Structured Datasource
    • ❓Onboarding Assessments
      • Structured Datasource Onboarding Questionnaire
        • MongoDB/CosmosDB Questionnaire
        • Oracle Datasource Questionnaire
      • SMB Questionnaire
    • 🛠️Administration
      • Audit Logs
      • SMTP
        • Basic and oAuth Configuration
      • User Management
        • SAML Identity Providers
          • Okta
            • LightBeam Okta SAML Configuration Guide
          • Azure
            • Azure AD SAML Configuration for LightBeam
          • Google
            • Google IDP
        • Local User Management
          • Adding a User to the LightBeam Dashboard
          • Reset Default Admin Password
  • 📚Support & Reference
    • 📅Release Notes
      • LightBeam v2.2.0
      • Reporting Release Notes
      • Q1 2024 Key Enhancements
      • Q2 2024 Key Enhancements
      • Q3 2024 Key Enhancements
      • Q4 2024 Key Enhancements
    • 📖Glossary
Powered by GitBook
On this page
  • Overview
  • Steps to generate Azure Blob Storage credentials
  • Add IAM access to containers for the above application
  • Connecting Azure Blob Storage Data Source
  • About LightBeam
  1. Core Features
  2. Spectra AI
  3. Data Sources
  4. File Repositories

Azure Blob

Connecting Azure Blob to LightBeam

PreviousAWS S3NextGoogle Drive

Last updated 10 months ago


Overview

LightBeam Spectra users can connect various data sources to the LightBeam application and these data sources will be continuously monitored for PII, PHI data.

Example: Azure Blob Storage, AWS S3, Google Drive, OneDrive, etc.


Steps to generate Azure Blob Storage credentials

  1. Log in to

  2. Click on Portal.

  1. Click on the Search box on the top navigation bar. Type and search “App Registrations”.

  2. Click on App Registrations.

  1. Click on New Registration. Add details as shown below and click Register.

  1. Click on Certificates and secrets.

  2. Click on New client secret.

  3. Fill in the client secret value in the Description and Expires fields.

  4. Click on Add.

  1. Copy the Client Secret value and keep it secure for future use as you will not be able to retrieve it later.

Example: 0d67021d-376a-4c64-9f03-4b69e9716076

  1. Configure API Permissions.

Click API permissions -> Add a permission -> Azure Storage

Then, we need to add permission for Azure Data Explorer

Similarly, we need to add permission for Azure Service Management

Once the permissions are added, your application is ready to register.

Click on Overview and get Application Client Id and Directory Tenant Id.

With this now we have all the required configuration parameters like Client ID, Client Secret value, Tenant ID to onboard the Azure Blob datasource but we need to first add this application to IAM policy of the containers that we need to sync with Lightbeam.

Add IAM access to containers for the above application

To add access of Azure Blob Storage containers to the above application we need to allow the application in containers IAM policy.


Example to allow all the containers of an Azure storage account to sync

  1. Open Azure Storage account on Azure portal.

  2. Open Access Control (IAM) from left side bar and select Role assignments.

  3. Select Add -> Add role assignment and select

    1. Reader

    2. Storage Blob Data Reader

    3. Storage Queue Data Contributor

    4. Storage Account Contributor

    Note: If multiple select is not working, Please add policy one by one

  4. Open Azure Subscription which is parent of the above storage account and similarly add

    1. EventGrid Contributor

  5. Click on Next

    1. Assign access to: User, group, or service principal

    2. Click on Select members

    3. Search the name of application created above (we have to search it because azure does not show application name by default and only on search we will be able to find it.)

    4. After selecting the above app, click on Review + assign

    5. On successful assignment of permission, the application credentials is ready to sync the containers present in this storage account with Lightbeam.

      Note: It may take upto 10 mins for permission to take effect.

Note: This same process can be done on different levels, like Subscription, resource group, individual containers.


Connecting Azure Blob Storage Data Source

  1. Login to your LightBeam Instance.

  2. Click on DATASOURCES on the Top Navigation Bar.

  3. Click on “Add a data source”.

  1. Search for “Azure Blob Storage”.

  1. Click on Azure Blob Storage.

  2. Fill in the requested information and click on Next.

Basic Information

  1. Data Source Name: This is the unique name given to the data source.

  2. Description: This is an optional field needed to describe the use of this data source.

  3. Primary Owner: Email address of the person responsible for this data source which will get alerts by default.

  4. Entity Creation: LightBeam Spectra detects and associates attributes based on the context and identifies whose data it is; these are called entities. Example: Jane Doe is an entity for whom LightBeam Spectra might have detected Name and SSN in a monitored data source.

  5. Source of Truth: LightBeam Spectra includes monitored data sources that serve as a single point of truth. These sources are utilized for looking up entities/attributes to verify the accuracy of attributes/entities discovered in other data sources. By using a Source of Truth dataset, entities are formulated based on the attributes present in the data.

  6. Location: The location of the data source.

  7. Purpose: The purpose of the data being collected/processed.

  8. Stage: The stage of the data source. Example: Source, Processing, Archival, etc.

Datasource Configuration

7. Please provide the credentials below and hit Test Connection.

LightBeam uses the Live Scan approach, which tracks changes made to objects in containers and makes use of Azure Event Grid to provide real-time updates of these changes.

Each container's storage account must have the Event Grid service enabled for this to work. If it isn't already enabled, LightBeam will do it automatically.

Please ensure that appropriate permissions to do this are configured with these credentials.

  • Client Id: It refers to the unique identifier assigned to the Azure portal application that is used for integrating LightBeam with the Blob Storage data source. It is generated when you register an application in the Azure portal.

  • Client Secret value: It is a confidential key or password associated with the Azure portal application. It is used to authenticate and authorize the application when accessing SharePoint resources. The Client Secret value is generated when you create a new client secret in the Azure portal.

  • Tenant Id: It is a unique identifier assigned to the Azure Active Directory (AAD) tenant associated with the organization. It represents the organization's directory or identity store in Azure AD. The Tenant Id is obtained from the Azure portal.

2. Verify that you get the message Connection Success! on the screen. Click on Next.

3. In this step, you can choose either of two scan setting options –

i) Scan all containers

ii) Scan selected containers

iii) Scan selected folders

To choose option (i), select Scan all Containers, and click on Validate And Save.

This will allow for the registration of the Azure Blob Storage containers.

To choose option (ii), select Scan selected Containers. Now enter the names of the buckets that you would like to scan in the Search box individually. Select the buckets by ticking the checkboxes next to them.

  1. Once the required buckets is selected, click on Save

Now that the Azure Blob Storage datasource is connected to LightBeam, we can begin viewing the dashboard and other pages of the onboarded datasource.


About LightBeam

LightBeam automates Privacy, Security, and AI Governance, so businesses can accelerate their growth in new markets. Leveraging generative AI, LightBeam has rapidly gained customers’ trust by pioneering a unique privacy-centric and automation-first approach to security. Unlike siloed solutions, LightBeam ties together sensitive data cataloging, control, and compliance across structured and unstructured data applications providing 360-visibility, redaction, self-service DSRs, and automated ROPA reporting ensuring ultimate protection against ransomware and accidental exposures while meeting data privacy obligations efficiently. LightBeam is on a mission to create a secure privacy-first world helping customers automate compliance against a patchwork of existing and emerging regulations.

Please check

Figure 10. Add Data Source

Note: To get the Azure Blob Storage connection details please check .

For any questions or suggestions, please get in touch with us at: .

🧠
🔦
🔗
https://learn.microsoft.com/en-us/azure/role-based-access-control/role-assignments-portal?tabs=delegate-condition
Appendix
support@lightbeam.ai
https://azure.microsoft.com/en-gb/
Figure 1. Microsoft Azure Portal
Figure 2. Click on App Registrations
Figure 3: Select App Registration details
Figure 4. Add client secret value
Figure 5. Client secret value
Figure 6. Add Azure Storage access to the App
Figure 7. Add Azure Data Explorer access to the App
Figure 8. Add Azure Service Management access to the App
Figure 9: Registered Application overview
Figure 11: Find Datasource
Figure 12. Lightbeam Azure Blob Storage - Basic Information
Figure 13.1(a) Scan all containers - Registration of Azure Blob Storage
Figure 13.1 (b) Select only specific containers