Skip to main content

Exchange Mailbox

For the Exchange Mailbox source, you can configure the list of folders/emails to exclude from processing. Do the following:

  1. In the management console, click SourcesExchange Mailbox, then Collection Exclusion will be displayed.

  2. To create an exclusion, click Add.

  3. boxexclusions

  4. In the Details window, on the Filter tab enter the name of the entity to exclude. Consider the following:

    • If you specify a folder name (e.g. “Drafts”) with no special characters, then any folders with that specific name will be excluded. Note that adding an exclusion of this type will match any folders with the name provided, wherever they are within the mailbox.
    • Wrapping the exclusion in wildcard indicators (e.g. “*Deleted*”) will match any folder/email with “Deleted” somewhere in the title.

    NOTE: You can optionally enter exclusion location in the Test Path field to verify the new filter, and click Test.

  5. If needed, you can use metadata conditions to restrict when an exclusion filter should be applied. For that, click Condition tab and click Add. Then select how the exclusion conditions will work: it can check if metadata field of the document has any value, is not specified, or matches a specific metadata value.

    CriteriaCondition
    ComparisonCompare a value in the document metadata field with the value set by condition. With this criteria selected, you will then need to specify: - Field name — document metadata field to check - Comparison — operator to use (for example, "does not contain") - Value — value to compare against For example, to exclude documents tagged with year 2018, set the condition as follows: - Field NameDocYear - Comparisonequals - Value2018
    Has any valueExclude the document if its metadata field has any value. With this criteria selected, specify Field Name.
    Has no valuesExclude the document if metadata field value is not specified. With this criteria selected, specify Field Name.

    When finished, click Add.

  6. Finally, click Save and close the window.

Exchange Mailbox

Use the Exchange Mailbox source to enable the crawling and classification of content stored in a single Exchange mailbox on the on-premises Exchange server or Exchange Online.

  1. In Netwrix Data Classification management console, open the Sources view and click Add.
  2. Select Exchange Mailbox source type and in the properties window specify the necessary settings.

Authentication type: Modern authentication

If you plan to use this authentication type (available only for Exchange Online mailbox processing), specify the following:

OptionDescription
Authentication typeSelect Modern (Exchange Online)
Admin UsernameSpecify the administrative account for the required Exchange Online organization.
Tenant IDEnter the Tenant ID you obtained at Step 5: Obtain Tenant ID.
Certificate thumbprintEnter the certificate thumbprint you prepared at Step 4: Configure Certificates & secrets.
Application IDEnter the app ID you got at application registration at Step 2: Create and Register a new app in Azure AD (it can be found in the Azure AD app properties >Overview).

exchangeonline_cfg_modern_auth_thumb_0_0

Authentication type: Basic

If you plan to use this authentication type, you will need to specify the following:

OptionDescriptionComments
Email Address / PasswordAdministrator account that has been assigned both: 1. Impersonation right 2. Discovery Management roleSee Configure Microsoft Exchange for Crawling and Classification for details on the rights assignment.

Other configuration settings

By default, only basic settings are displayed. To view advanced options, click the "wrench" icon at Settings in the bottom.

OptionDescriptionComments
Basic settings
MailboxMailbox to be crawled.When using impersonation, the settings can be like the following example: - Email Address - administrative account granted Impersonation right, e.g. administrator@cs.com - Mailbox - target mailbox, e.g. test@cs.com.
Crawl RangeDefine what portions of data should be retrieved from the Exchange server: - Select Date Range to crawl a static set of data within the required interval. - Select Since if you want to periodically re-crawl content from the specified date, taking into account the last crawl date for each object.
Crawl In-Place ArchiveSelect this option if you want to crawl Exchange Online in-place archive mailboxes.Applies to Exchange Online.
OCR Processing ModeSet the processing mode for document images: - Disabled - document images will not be processed - Default - defaults to the source setting (if configuring a path) or the global setting (if configured on a source) - Normal - process the images with normal quality settings - Enhanced - upscale the images further to allow more accurate results.The Enhanced mode will provide better accuracy but can lead to longer processing time if the images do not contain text.
Source GroupSelect the source group (if any).
Pause source on creationSelect if you want to make other configuration changes before data collection occurs.
Advanced settings
Build Search IndexSelect if you want search index to be created.
Re-Index PeriodSpecify how often the source should be checked for changes. Default is 7 days.Netwrix recommends using default values.
PrioritySet priority for this data source to be crawled. Select the priority level from the list values: - Highest - High - Normal - Low - Lowest
Document TypeSpecify a value which can be used to restrict queries when utilizing the Netwrix Data Classification search index.

Exchange Server

The Exchange Server source configuration screen allows you to enable the crawling and classification of multiple Exchange mailboxes from the same Exchange server.

IMPORTANT! Automatic detection, crawling and classification of multiple Exchange mailboxes from the same Exchange server (and, respectively, Exchange Server content source configuration) is only supported for Exchange Server 2013 or later due to limitations in the Microsoft APIs. For earlier versions, consider using Exchange Mailbox content source.

You can use Match Rules to include and exclude the certain mailboxes.

To configure an Exchange Server source:

  1. In Netwrix Data Classification management console, open the Sources view and click Add.
  2. Select Exchange source type and in the properties window specify the necessary settings.
  3. Remember to click the "wrench" icon next to Settings in the bottom-left corner to display all settings.

Authentication type: Modern authentication

NOTE: For Email Address / Password, the Administrator account that has been assigned the right of the Discovery Management role and be given the Mailbox Search and MailboxSearchApplication permissions.

If you plan to use this authentication type, specify the following:

OptionDescription
Authentication typeSelect Modern (Exchange Online)
Admin UsernameSpecify the administrative account for the required Exchange Online organization.
Tenant IDEnter the Tenant ID you obtained at Step 5: Obtain Tenant ID.
Certificate thumbprintEnter the certificate thumbprint you prepared at Step 4: Configure Certificates & secrets.
Application IDEnter the app ID you got at application registration at Step 2: Create and Register a new app in Azure AD (it can be found in the Azure AD app properties >Overview).

exchangeonline_cfg_modern_auth_thumb_0_0

Authentication type: Basic

If you plan to use this authentication type, you will need to specify the following:

OptionDescription
Email Address / PasswordAdministrator account that has been assigned the right of Impersonation as well as the Discovery Management role. See Configure Microsoft Exchange for Crawling and Classification for details on the rights assignment.

Other configuration settings

The following settings are also required in both cases:

OptionDescription
Exchange API URLBy default, the crawling engine will attempt to locate the necessary URL of Exchange Web Services API by using the Exchange AutoDiscover functionality. So, typically, you can leave this field blank. If, however, the Exchange AutoDiscover is not available, then you should specify the Exchange API URL explicitly as follows: https://<servername>/EWS/Exchange.asmx.
Crawl RangeDefine what portions of data should be retrieved from the Exchange server: - Select Date Range to crawl a static set of data within the required interval. - Select Since if you want to periodically re-crawl content from the specified date, taking into account the last crawl date for each artifact.
Match RulesDefine which mailboxes will be crawled as part of an Exchange Server source. Examples: 1. .*@netwrix.com— enter the wildcard (*) and the domain (here netwrix.com) to restrict crawling to a set of domain mailboxes 2. .*—enter if you want all mailboxes to be crawled
Detection PeriodSpecify how often the source should be checked for changes. Default period is 1 day.

Having specified all the necessary settings, click the Save button.