Managing Multiple eDiscovery Cases in Purview: Where Things Go Wrong
Learn effective strategies for managing multiple eDiscovery cases in Microsoft 365 Purview, and discover how to overcome common bottlenecks and...
In Microsoft Purview eDiscovery, indexing is the process that allows search tools to read and understand the content ...
In Microsoft Purview eDiscovery, indexing is the process that allows search tools to read and understand the content inside files, emails, and attachments. When data is fully indexed, every searchable element, such as text inside documents or attachments, can be discovered through search queries. However, when an item becomes partially indexed, only part of its content is searchable. The rest of the data remains hidden from standard searches.
Partial indexing occurs when Microsoft Purview cannot completely process a file due to technical limitations related to size, complexity, or time constraints. In these situations, the system indexes only the portion of the file it was able to process before reaching those limits.
From an eDiscovery perspective, this creates a major problem. Legal teams rely on search tools to locate relevant evidence quickly and accurately. If a file is only partially indexed, critical information may exist inside the document but remain undiscoverable through normal search queries.
A simple analogy helps illustrate the issue. Imagine searching for a word inside a book where only the first half of the pages were scanned into a digital search system. If the word appears on a page that was never scanned, the search tool will report that the word does not exist in the book, even though it actually does.
The same situation occurs with partially indexed data in Microsoft Purview.
Attachment limits, parsing timeouts, and Excel size caps in Microsoft Purview are key technical restrictions that cause files to become partially indexed. Understanding these limits helps explain why some documents cannot be fully searched during investigations.
Partial indexing is not random. It occurs because Microsoft Purview applies several technical limits when attempting to read and index files. These limits are designed to maintain system performance, but can directly affect the completeness of search results.
One major limitation involves email attachment size. Attachments that exceed 150 MB are not fully parsed for indexing. When this occurs, the email is flagged as partially indexed because the content inside the large attachment cannot be fully examined.
Parsing time limits: Microsoft Purview allows only 30 seconds to analyze and index a single item. If a file takes longer, due to complexity, embedded objects, or nested attachments, the process stops early, and the item becomes partially indexed.
Attachment quantity limits: Only the first 250 attachments within an email are indexed.
Attachment depth limits: Microsoft Purview indexes up to 30 nested attachments. Emails exceeding these thresholds will not have every attachment processed.
Microsoft Purview also places restrictions on spreadsheet indexing. Excel files larger than 4 MB cannot be fully indexed, which means significant portions of data stored in spreadsheets may not be searchable.
Finally, the indexing engine can only extract two million characters from a document. Any content beyond that threshold is ignored during indexing. Each of these limitations increases the likelihood that some files will be only partially indexed during an investigation.
Partial indexing in Microsoft Purview affects search accuracy because the platform cannot search the full contents of those documents. This directly affects the accuracy and reliability of eDiscovery searches.
Search accuracy is the foundation of any eDiscovery workflow. Legal teams depend on reliable search results to identify relevant communications, documents, and records across massive data sets. When partial indexing occurs, search accuracy becomes compromised. A search query may fail to return relevant items simply because the relevant text was located in the portion of the file that was never indexed.
This means investigators could mistakenly believe that no relevant data exists when, in reality, it does, it just resides in the non-indexed portion of the file. This challenge becomes especially significant when organizations are handling large legal matters, regulatory inquiries, or internal investigations. Missing even a small percentage of relevant records can affect the overall completeness of discovery.
The problem is not that the files are missing from the system. They are still present in the data set. The issue is that the search engine cannot see the full contents of those files.
Because eDiscovery workflows rely heavily on search queries to identify relevant material, partial indexing introduces uncertainty into the process.
Partial indexing affects legal defensibility in eDiscovery because organizations must be able to prove that their search and review process was thorough and reliable. When Microsoft Purview partially indexes files, this requirement becomes harder to meet.
Legal defensibility refers to the ability of an organization to demonstrate that its eDiscovery process was thorough, consistent, and reasonable. Courts and regulators expect organizations to conduct discovery in a way that reliably identifies relevant evidence.
Partial indexing creates several challenges for legal teams and investigators:
For legal teams, the core issue is visibility. If the indexing process cannot read the full contents of files, teams may unknowingly operate with incomplete information.

Partially indexed items require additional handling during the review process. Because their full content was not indexed, they cannot be reliably evaluated through automated search filtering alone.
As a result, organizations often need to export these items separately and review them manually. This increases the time and effort required during the discovery process. Manual review is significantly slower than automated search-driven review. When investigators must examine large numbers of partially indexed files individually, it can extend project timelines and increase legal costs.
The complexity also increases the risk of oversight. The more manual processes involved in discovery, the greater the chance that important documents could be overlooked. For large organizations with massive data volumes, this issue can scale quickly. Even a small percentage of partially indexed items could represent thousands of documents requiring additional attention.
The extra review burden also places pressure on legal teams operating under strict deadlines. When courts impose discovery timelines, delays caused by incomplete indexing can complicate compliance.
Legal teams often do not realize when data is incompletely indexed in Microsoft Purview because partial indexing is not always obvious during normal search workflows.
One of the most significant risks associated with partial indexing is that it often goes unnoticed. In many cases, legal teams assume that search results represent the complete set of relevant documents. Unless investigators specifically examine indexing reports or identify partially indexed items within search results, they may not realize that portions of files were never indexed.
Because the files themselves remain present in the system, there is no obvious indication that some content cannot be searched. This creates a silent risk within the discovery process. Investigators may conduct thorough searches, apply multiple keywords, and carefully review results, yet still miss critical information that exists inside non-indexed portions of documents.
The problem is not necessarily a lack of diligence. Instead, it is a limitation in how the indexing system processes complex or large files.
Without awareness of these indexing limitations, organizations may unknowingly rely on incomplete discovery data.
Expireon addresses partial indexing challenges because Microsoft Purview can partially index large or complex files, leading some organizations to adopt complementary platforms designed to eliminate these indexing limitations.
Organizations can reduce the risk of partial indexing by using platforms designed to overcome Purview’s technical indexing limits. Cloudficient’s Expireon addresses these constraints by providing more flexible and scalable indexing capabilities.
Key ways Expireon addresses partial indexing include:
By removing these constraints, Expireon enables searches across fully indexed data sets. This reduces the risk that relevant evidence remains hidden inside non-indexed portions of files and helps legal teams work with more complete discovery results.
Partial indexing is a documented limitation within Microsoft Purview eDiscovery that can affect the completeness of search results and discovery workflows.
Microsoft Purview provides powerful eDiscovery capabilities for organizations operating within the Microsoft 365 ecosystem. However, its indexing limitations can introduce hidden risks into the discovery process.
Partial indexing occurs when files exceed technical thresholds related to size, complexity, or processing time. When this happens, only part of a document becomes searchable, leaving the remaining content invisible to standard search queries.
These limitations can affect search accuracy, increase review workloads, and introduce potential legal defensibility challenges. Because partially indexed items often require separate review processes, organizations must take additional steps to ensure that important data is not overlooked.
Understanding how partial indexing works is essential for legal and compliance teams responsible for managing eDiscovery workflows. Awareness of these limitations allows organizations to better plan discovery strategies and avoid relying solely on incomplete search results.
What happens to partially indexed items during a Purview search?
Partially indexed items are still included in search result estimates, but the system cannot search all of their contents, which may require separate review of those items.
Why does Microsoft Purview limit indexing time for files?
Purview limits parsing to about 30 seconds per item to maintain system performance and prevent large or complex files from slowing down the indexing process.
Can large files still appear in search results if they are partially indexed?
Yes. Large files may appear in results, but only the portion that was successfully indexed is searchable, which means relevant information inside the file may still be missed.
Why do organizations sometimes export partially indexed files separately?
Because Purview cannot fully index those items, teams often export them separately so investigators can manually review the content that the search engine could not process.
How do attachment structures affect indexing in Purview?
Emails with large numbers of attachments or deeply nested attachments may not be fully indexed because Purview only indexes the first 250 attachments and up to 30 nested levels.
Learn effective strategies for managing multiple eDiscovery cases in Microsoft 365 Purview, and discover how to overcome common bottlenecks and...
Understand the hidden costs of Microsoft 365 Purview eDiscovery licensing and ensure your legal team has the necessary tools for complex...
Learn about the limits of Microsoft Purview for legal holds. Find out how CaseFusion can improve governance, automation, and reporting.