1.2 Searching in Retain’s Archives

Path: Retain Server Manager > Overview > Search Messages

Figure 1-1 The Search Messages page

If you log in as a user with administrative privileges, you see the six tabs documented in the sections that follow. If you log in as a user without administrative privileges, the Search tab opens and you can search on only the mailboxes where you have access as a regular user.

1.2.1 The Browse Tab

The Browse tab immediately grants access to the selected mailbox. Individual users will only see their own mailbox, while users with the administrator right to search all mailboxes will have the option to change mailboxes to another user.

Path: Retain Server Manager > Overview > Search Messages > Browse tab

Retain is an archiving solution not a message management system. All items are stored in the location Retain found them in the production system and cannot be moved.

To change to a different mailbox, select the ‘Change Mailbox’ button in archive toolbar. Once clicked, the ‘Select Mailbox’ dialog will open.

The Select Mailbox dialog asks for specific information. Select which mailbox you want to see by selecting its radio button and clicking OK.

When searching for a mailbox, the system of the mailbox must be selected as mailboxes from different systems may have the same user name or criteria. Select which mail system the desired user belongs to, specify any further criteria, or leave the criteria blank to display all possible mailboxes from that system.

(If Retain for Social Messaging is set to anonymous user, all Social Messaging data will be contained under the single user ‘?@?’, and separate user names and pertinent information is contained in the ‘from’ dialog. Otherwise, individual user accounts will be displayed.)

If the search results are extensive, the system will have a ‘Next’ or ‘Previous’ button at the bottom of the search results window, which displays the next set of results.

Refine search parameters to reach a manageable search result.

In Retain, you can only browse one mailbox at a time. To search multiple mailboxes simultaneously, use the search function.

After selecting a mailbox, click ‘Ok’ to load that mailbox into the viewer.

How the browse interface appears will depend on the email system being archived.

Exchange

Exchange

GroupWise

Google Apps

Blackberry

CellTrust

Mobile

1.2.2 Using the Search Tab

Retain’s Search feature designed to work much like a Google search, with anticipated auto-correct search terms. The Search tab allows searching across the entire Archive for administrators, while users can only search their own archives.

Path: Retain Server Manager > Overview > Search Messages > Search tab

The Search tab allows for management of searches. This is accomplished by Saving searches for later use. These searches can then be shared with others, loaded later, removed when no longer desired, and trimmed to remove duplicate data.

The icons for these functions are on the left, next to the search field. In order: Load, Save, Share, Delete, and Collapse Duplicates. Collapse Duplicates removes all duplicate results in GroupWise mailboxes.

To begin a search, type the desired search term into the search query field, or load a saved search. The search results will be automatically populated on specified search terms after a sufficient pause in typing. The auto complete suggestion feature requires at least 3 characters to be specified and a pause of several seconds before it will begin to work.

Tokenized Search Phrase

The indexing engine follows Unicode Standard Annex #29. This standard uses many common characters such as ' " . : @ + - * / and , as phrase-ending or-beginning characters. These characters will cause the system to read any character separated terms as individually entered items when the search is performed. Individually entered terms are treated as OR searches. For example, this means that using the search term 10/20 will be processed as 10 OR 20. Substituting a space instead of the character will provide a logical 'AND' search. For example, searching for 10 20 will be processed as 10 AND 20 as the search term. For example, searching for an email address test@example.com.com will drop the @ even in an exact search with double quotes and with return test AND example.com.

Spaces between normal search terms are a delimiter between search terms:

The search algorithm follows these criteria:

  1. Highest weighting: All delimited words together consecutively. (e.g. Searching for "quick brown fox" while the message contains: "The quick brown fox jumped over the lazy dog.")

  2. All delimited words in the string although they may not be right next to each other. (e.g. quick AND brown AND fox)

  3. Least weighting: Any of the delimited words in the string. (e.g. quick OR brown OR fox)

Results from these different terms are weighted in the order listed above. This behavior can be overridden by locating the solarcloud.indexing.properties file and changing the property phraseSearch.singleWordMatch. Setting this property to '0' turns this off, while leaving it set at '1' activates this search behavior.

Result terms are also highlighted in the user interface to ease visual confirmation that the term exists in the message. However, terms in attachments are not highlighted.

Wildcard Search

Accepted wildcard characters are: *, ?, and "". The ‘*’ denotes ‘any character or characters’, while the ‘?’ denotes ‘any one unknown character’. The double quotes(“”) denote an exact phrase, and only that exact phrase. Retain does not index letter case, so “IT” is the same as “it”.

For example, a search for the term “We all love spoons” will fetch a result of that exact phrase. Asterisks may be added inside the double quotes to allow for incomplete or unknown words: “*e all love sp*” will return both ‘We all love spoons for our ice cream’, and ‘We don’t like sporks but we all love space.’. While the ‘?’ works like the asterisk, it is only for a single unknown character. This is particularly helpful when searching for exact phrases where the terms may be misspelled. For example, “Their going” would miss a misspelled ‘there going’. However, if the search term were “The?? going”, it would catch both.

Supported Regex characters are listed in the Advanced Search section.

Select a final query, either by hitting enter, clicking with the mouse, or using arrows and hitting enter.

Search results will display the results with the search term highlighted for each message. The message type, Sender, Subject, Recipients, date, mailbox, and folder are all displayed.

Search In

Once search results have begun to populate, the left hand scope pane is populated with limiting and filtering options. If faceting is enabled, the side bar will show numbers next to each section, indicating how many hits there are for each particular topic. The hits are total numbers of matching instances, not items. So if a message states a search term several times in the message body, it will be counted as that many hits even though it is only one message.

The Search In criteria limits the area in the message or data where the search is performed:

  • The Subject indicates hits in the Subject field.

  • The Sender field contains the sender of the message or data item.

  • The Recipient is the recipient of the item or message.

  • The Message Content will search in the following locations within a message:

    • body

    • attachment

    • subject

    • headers

Searching exclusively for the domain will be effective with search terms if a complete domain is provided, otherwise the term is recognized as text. If the top level domain is not known, (.com, .org, .edu, .etc) then the search term should use an asterisk afterwards. For example, searching for example.com will yield good results, as will searching for example.* or example*, though results will vary.

Searching for an item that has BCCed recipients has some special behavior. Retain acts like a normal email system, only the sender and the BCCed recipient can see themselves. If you are logged in as the sender, you will be able to see To, CC and BCC recipients as normal. If you are logged in as the To, or CC recipients, you will only see the Sender, To and CCed recipients. If you are logged in as a BCC recipient, you will see the Sender, To, CC, and no other BCC recipients. If you are logged in as admin and in a mailbox other then the Sender, you will not see the BCC recipients listed in the item even though they are returned from the correct mailboxes, except from the Sender folder, because admin is not one of the BCC recipients.

Item Type

The Item Type criteria option limits the type of message which is to be searched. All item types are available in the scope term. Again, the number of available hits is displayed to the side.

The Message Item Type includes:

  • Mail

  • Phone Message

  • Appointment

  • Task

  • Note

  • Message

  • Phone Call

  • BB Pin

Item Source

The Item Source criteria option limits the results to a particular source. This source can be limited to show ‘received’, ‘personal’, ‘sent’, or ‘draft’ items.

Date Range

The Date Range criteria limits the time frame of item’s creation. Only messages which conform to the date range selected will be displayed in the results field.

The date range may be specified for any of the dates corresponding with an item. The range may also be selected form the drop-down menu, with pre-configured times for last week, month, or year. These time frames are for the past 7 days, the past 30 days, or the past year, not the previous calendar time frame.

Mailboxes

The Mailboxes criteria limit which mailbox or mailboxes the search will pull results from. To add users to the selected mailboxes list, click on the ‘Select’ button to launch the mailbox selection window.

The Select Mailboxes window allows for searching of every mailbox available to the user, the admin user can search all mailboxes. Mailboxes must be searched for by system and specified criteria. The results of the search are displayed below, while the active selected mailboxes can be added to the dialog through the use of the ‘Add Selected’ button along the top. Alternately, if the ‘Add All’ button is clicked, it will add all mailboxes displayed in the search results, the Address Book field. Addresses which have been added to the top field may be removed by selecting the red ‘X’.

Once the desired addresses have been added to the top window, select the ‘OK’ button to load them into the search pane.

The Address option limits the results to a selected address. The addresses available are displayed below, and may be selected Addresses in the window are dictated by what is in the result set. Selecting an address adds it as a filter to the top of the search window, or the user may specify an address manually. Multiple addresses may be added to the search window at a time. To remove an address filter, select the ‘X’ next to the active address, and the result set will be reset.

Tag

The Tag option limits the search to items which have been tagged with a specific tag definition. The tag definitions may be personal or global. Tags must be specified in advance and applied before this option will work. Select the desired tag to limit the result set.

Misc.

The Misc option contains limits for the Litigation hold and the Confidential tags. They have two settings: True or False. A Setting of ‘True’ restricts all results to only items which have the selected tag.

1.2.3 The Advanced Search Tab

The Advanced Search tab contains the ability to specify vast amounts of criteria and combine search terms to exclude and include various searchable items to retrieve specific information. This search works better, the more you know about what you are looking for, as it allows fine tuning of criteria.

Path: Retain Server Manager > Overview > Search Messages > Advanced Search tab

Searches will be restricted to the mailboxes specified in the ‘Search Mailboxes’ window. By default, all mailboxes are set for searching. To limit the search to the mailbox or mailboxes specified, click on the Select button to open the ‘Select Mailboxes’ window. The ‘Select Mailboxes’ window functions exactly the same in advanced search as it does in the standard search.

The system will begin to display results as soon as either the search mailboxes have been specified, or new search criteria has been added. To add new criteria, select the Add button.

The Search Criteria contains the ability to specify where to search, operating criteria - (word ends with, word starts with, field contains words, field contains phrase), and the desired search terms. The list of search items and fields available to be specified in the drop-down list is shown. Each variable on the list is tied to appropriate search operators, (date range allows the specification of a date, Confidential tags have a true/false operator, etc.)

Search Field

Subject: search the message subject field.

Recipient: search the message recipient field.

Attachment name: search the names of the message attachments.

Category: search the item’s category field.

Date: this depends on the type of item. This can be a range. If it is a sent email or instant message, it is the sent date. If it is a received email, then the received date; an appointment, the appointment date.

Sent date: search the message sent date field. If the message is an email this can set by email sender and can be spoofed.

Received date: search the message received date field. If the message is an email, it is set by receiving email server and is very reliable.

Begin date: The earliest date to be searched for calendar items, appointments, tasks and so on.

End date: The latest date to be searched for calendar items, appointments, tasks and so on.

Tag: search tags set within Retain.

Litigation hold: search items that have litigation hold applied by Retain.

Confidential: search items that have confidentiality applied by Retain.

Item Type: search the item type, which may be Mail, Phone message, Appointment, Task, Note, Message, Phone call, and BB PIN.

Item Source: search the item source of Received, Sent, Personal, or Draft.

Sender (email): search sender by email address.

Sender (display): search sender by display name.

Sender Domain: search by the sender domain.

Recipient Domain: search by the recipient domain.

Mail server: search by the mail server of sender and recipient.

Messaging Domain: search by the messaging domain of sender and recipient.

Phone number: search by phone number, if the phone number field exists.

Location: search by location, if the location field exists.

Internet Header: search the term in the Internet header field

Message Content: search only the content (body and attachments)

Attachment size: search by attachment size in bytes.

Opened: search only messages that have been opened.

Read: search only messages that have been read.

Private: search only messages that have been marked private.

Operating Criteria

Each field can be restricted to:

Field Contains Phrase: An exact search of the phrase, the same as enclosing the phrase in quotes in simple search. Basically ANDing each word in the phrase. For example, search for “The quick brown fox” will return only items with the entire phrase ‘The quick brown fox’

Field Contains Words: will search for each word in the phrase. This will be ranked by closest match at the top. Basically ORing each word in the phrase. For example, search for “The quick brown fox” will return only items with the entire phrase ‘The OR quick OR brown OR fox’.

Field Does Not Contain Words: will exclude search results with the words.

Words Start With: will search for the word but with a wildcard at the end. For example, deter will be treated like deter* and return determine, determined, deterred, and so on.

Word Ends With: will search for the word but with a wildcard at the front. For example, “tion” will be treated like *tion and return action, playstation, function, and so on.

Cascading Options

In addition, the interface allows for no limit of search terms, additional terms can be added to the search criteria and connected to the previous search terms. Additional criteria may be logically connected with ‘and’, ‘or’, or ‘new group’. To add a new search term and criteria, select the ‘+’ directly to the right of the existing search criteria.

By default, when a new search term is added, it is automatically ‘AND-ed’ together with the previous search term. This allows you to be able to build complex search terms to fit known data.

When building complex search criteria, it is critical to know what you are looking for. For instance, if an insider trading tip was suspected, and the recipient was known as well as some details about the message and when it must have been sent by, the following search could be compiled:

In this search, any message sent which stated ‘merge’, or ‘merger’ in the subject, and contained a known company secret in the message body, or, discussed the name of an executive involved, would be displayed. In addition, the search would also grab any messages sent to the suspected contact before the merger date. Additional criteria which could be added includes the company’s stock listing or any further details pertaining to the proposed leak.

To begin the search, select the Save button at the bottom of the query window to perform the search. The active criteria is now listed in the left pane, and may be edited or removed. To add criteria, select the Edit button to add to or refine the search criteria.

RegEx and Wildcards

Both the Search and Advanced Search contain limited support for Regular Expression (regex) searches. An explanation of regular expression searches are beyond the scope of this documentation. There are extensive tutorials on the Internet.

To use Regular Expressions, simply put the desired regex string into the criteria field, denoted by a ‘/’ on either side of the regex, for example /red queen/. If the ‘/’ is not used, Search will not recognize it as regex.

Wildcard searches may be done with the ‘*’ and ‘?’ characters. The ‘*’ will match zero or more characters, and the ‘?’ will match exactly one character.

Special Characters

The Search has a list of special characters which cannot be searched for, and will cause erratic results with search criteria. The list of non-supported characters is: @,+,-,|,[],{},(),”,\,#,&,~. All of these characters are viewed as delimiters, and will break up the query. They are not supported and will be replaced by a space.

RegEx Example

You can search for a string of numbers, for example a US telephone number, of the format: (012) 345-6789.

You can enter into the search field /[0-9]{3}/ /[0-9]{3}/ /[0-9]{4}/ and all number strings that match a three digit, three digit, four digit grouping will be returned.

1.2.4 The Exported Items Tab

The Exported Items tab shows the export jobs which are currently running, or have been run in the past, on the system. If an export job has completed, this tab will contain the file and provides a link for download of the completed export job.

Path: Retain Server Manager > Overview > Search Messages > Exported Items tab

NOTE:PST files open in Microsoft Outlook.

Adobe PDF Portfolio files require Adobe Flash for viewing.

Once complete, a notification email is sent, if a notification address was provided, and the PDF or PST .zip file is shown here.

Locate the desired export and select the disk icon to download the export file.

A notification also shows in your Notification Center found under "Welcome, [username]" at the top center of the Retain Mailbox web console.

1.2.5 The Tag Definitions Tab

The Tag Definitions tab allows the creation and removal of Tags, their automatic comment, and name. Tags are an informative note which can be attached to any data item in the search messages interface. There is no limit to how many tags any one item may have applied to it, and there is no limit to how many tags a user may create. In addition, tags are also a searchable item, making this one of the most versatile ways to add long-term identification for items in the data store.

Path: Retain Server Manager > Overview > Search Messages > Tag Definitions tab

Before the tag icon will appear on the in the search interface tool bar, there must be at least one tag defined. To define a tag, enter the tag name and initial comment if desired, then, if the user has permissions to do so, define whether the tag is personal or global. Once saved the tag is available for use.

Global tags are tags that any user with the rights to see global tags will be able to view and apply. Personal tags are limited to the user who created them. Only tags visible to users will be available to be searched for by that user.

Any tags created or subject to manipulation by the user logged-in will be displayed under this tab.

To apply a tag to a message or data item in the search messages interface, simply select the data item or items, and then click the ‘Add / Remove’ tag button in the tool bar.

1.2.6 The Options Tab

Path: Retain Server Manager > Overview > Search Messages > Options tab

The options section here is exactly like the section in the Administration | Users section. These settings here are specific to the currently logged in user. .

Path: Retain Server Manager > Overview > Search Messages > Options tab

The Settings Subtab

Path: Retain Server Manager > Overview > Search Messages > Options tab > Settings subtab

Table 1-1 Using the Settings subtab

Field, Option, or Button

Information and/or Action

User-specific Settings Panel

These settings are specific to each user.

  • User ID

  • User ID of the current user.

  • Description

  • Optional information about the user.

  • Authentication Method

  • User’s authentication method.

  • Primary UID

  • System-assigned string that links the user’s rights.

Inheritable Settings from Group Panel

These settings can be inherited from the specified Configuration Group.

  • Group Membership

  • Groups the user belongs to.

  • Account Can Expire

  • If enabled, the account can expire.

  • Change Internal Password

  • If Allow User to Change Password is set to yes, the user can reset the password in Retain..

  • Forwarded Messages Comment

  • The default comment for forwarding messages.

    You can have the user inherit this setting from the specified Configuration Group, or you can set it to one of the values in the drop-down list.

  • Forwarded Messages Internet Domain

  • Automatically append the specified address to forwarded messages.

    You can have the user inherit this setting from the specified Configuration Group, or you can set it to one of the values in the drop-down list.

  • Date Display Format

  • How to display dates.

    You can have the user inherit this setting from the specified Configuration Group, or you can set it to one of the values in the drop-down list.

  • Time Display Format

  • How to display time.

    You can have the user inherit this setting from the specified Configuration Group, or you can set it to one of the values in the drop-down list.

  • Display Number of Messages Per Page

  • How many items to display per page.

    You can have the user inherit this setting from the specified Configuration Group, or you can set it to one of the values in the drop-down list.

  • Message Age Display

  • Default date filter for searching. Can be changed on the fly.

    You can have the user inherit this setting from the specified Configuration Group, or you can set it to one of the values in the drop-down list.

  • Session Timeout (Minutes)

  • A value between 10 and 480 minutes.

    You can have the user inherit this setting from the specified Configuration Group, or you can set it to one of the values in the drop-down list.

The User Rights Subtab

You’re shown the rights you have within Retain on this screen. Available rights may only be changed by a user administrator within the Administration screen.

The Mailboxes Subtab

In this section, you’re shown the mailboxes you have been given explicit rights to work in. By default, you have rights to only your own mailbox. If an administrator explicitly gives you rights to other mailboxes or you are a member of a group that has rights to other mailboxes explicitly, those mailboxes will be shown here.

The Confidential Exceptions Subtab

When a user marks a message as ‘Confidential’ in the archive, the message becomes invisible to all except administrators who have been given the confidential right, or any user which is specified in the exception list.

The ‘Confidential Exceptions’ tab allows users to add any necessary exceptions to the confidential tagging. Confidential tagging may be applied to protect sensitive. However, sometimes this information may need to be viewed by others and instead of granting that user rights to see confidential items for all users, a user may apply that right to only their items.

A group or individual user may be added to or removed from the list.