Skip to main content

Exchange Mailbox

Use the Exchange Mailbox source to enable the crawling and classification of content stored in a single Exchange mailbox on the on-premises Exchange server or Exchange Online.

Step 1 – In Netwrix Data Classification management console, open the Sources view and click Add.

Step 2 – Select Exchange Mailbox source type and in the properties window specify the necessary settings.

Authentication type: Modern authentication

If you plan to use this authentication type (available only for Exchange Online mailbox processing), specify the following:

OptionDescription
Authentication typeSelect Modern (Exchange Online)
Admin UsernameSpecify the administrative account for the required Exchange Online organization. The user must have a mailbox connected to it in order to crawl Exchange.
Tenant IDEnter the Tenant ID you obtained at Step 5: Obtain Tenant ID.
Certificate thumbprintEnter the certificate thumbprint you prepared at Step 4: Configure Certificates & secrets.
Application IDEnter the app ID you got at application registration at Step 2: Create and Register a new app in Azure AD (it can be found in the Azure AD app properties >Overview).

exchangeonline_cfg_modern_auth_thumb_0_0

Authentication type: Basic

If you plan to use this authentication type, you will need to specify the following:

OptionDescriptionComments
Email Address / PasswordAdministrator account that has been assigned both: 1. Impersonation right 2. Discovery Management roleSee Configure Microsoft Exchange for Crawling and Classification for details on the rights assignment.

Other configuration settings

By default, only basic settings are displayed. To view advanced options, click the "wrench" icon at Settings in the bottom.

OptionDescriptionComments
Basic settings
MailboxMailbox to be crawled.When using impersonation, the settings can be like the following example: - Email Address - administrative account granted Impersonation right, e.g. administrator@cs.com - Mailbox - target mailbox, e.g. test@cs.com.
Crawl RangeDefine what portions of data should be retrieved from the Exchange server: - Select Date Range to crawl a static set of data within the required interval. - Select Since if you want to periodically re-crawl content from the specified date, taking into account the last crawl date for each object.
Crawl In-Place ArchiveSelect this option if you want to crawl Exchange Online in-place archive mailboxes.Applies to Exchange Online.
OCR Processing ModeSet the processing mode for document images: - Disabled - document images will not be processed - Default - defaults to the source setting (if configuring a path) or the global setting (if configured on a source) - Normal - process the images with normal quality settings - Enhanced - upscale the images further to allow more accurate results.The Enhanced mode will provide better accuracy but can lead to longer processing time if the images do not contain text.
Source GroupSelect the source group (if any).
Pause source on creationSelect if you want to make other configuration changes before data collection occurs.
Advanced settings
Build Search IndexSelect if you want search index to be created.
Re-Index PeriodSpecify how often the source should be checked for changes. Default is 7 days.Netwrix recommends using default values.
PrioritySet priority for this data source to be crawled. Select the priority level from the list values: - Highest - High - Normal - Low - Lowest
Document TypeSpecify a value which can be used to restrict queries when utilizing the Netwrix Data Classification search index.

Exchange Mailbox

For the Exchange Mailbox source, you can configure the list of folders/emails to exclude from processing. Do the following:

  1. In the management console, click SourcesExchange Mailbox, then Collection Exclusion will be displayed.

  2. To create an exclusion, click Add.

  3. boxexclusions

  4. In the Details window, on the Filter tab enter the name of the entity to exclude. Consider the following:

    • If you specify a folder name (e.g. “Drafts”) with no special characters, then any folders with that specific name will be excluded. Note that adding an exclusion of this type will match any folders with the name provided, wherever they are within the mailbox.
    • Wrapping the exclusion in wildcard indicators (e.g. “*Deleted*”) will match any folder/email with “Deleted” somewhere in the title.

    NOTE: You can optionally enter exclusion location in the Test Path field to verify the new filter, and click Test.

  5. If needed, you can use metadata conditions to restrict when an exclusion filter should be applied. For that, click Condition tab and click Add. Then select how the exclusion conditions will work: it can check if metadata field of the document has any value, is not specified, or matches a specific metadata value.

    CriteriaCondition
    ComparisonCompare a value in the document metadata field with the value set by condition. With this criteria selected, you will then need to specify: - Field name — document metadata field to check - Comparison — operator to use (for example, "does not contain") - Value — value to compare against For example, to exclude documents tagged with year 2018, set the condition as follows: - Field NameDocYear - Comparisonequals - Value2018
    Has any valueExclude the document if its metadata field has any value. With this criteria selected, specify Field Name.
    Has no valuesExclude the document if metadata field value is not specified. With this criteria selected, specify Field Name.

    When finished, click Add.

  6. Finally, click Save and close the window.