The PDF Filter

Parse & Pack uses a sophisticated AI-driven filtering system that goes beyond simple keyword matching. It understands the context of every page in your document.


How it works

When you upload a document, our filter performs a three-stage process:

  1. Extraction: The app parses the text and layout information from your PDF.
  2. Analysis: The AI "reads" each page to understand its purpose (is it a question, a solution, a table of contents, or an introduction?).
  3. Matching: The AI compares the content of each page against your specific instructions to decide if it should be kept, discarded, or categorized.

Intelligent Context

Our filter is designed to handle common educational and technical document structures:

  • Multi-page Questions: If a question spans across two pages, you can instruct the AI to "keep the full question even if it spans pages."
  • Referenced Diagrams: The AI can identify when a question on one page refers to a diagram on another, ensuring you don't lose vital context.
  • Semantic Matching: If you ask for "Trigonometry," the filter will keep pages mentioning SOH-CAH-TOA, Sine waves, or triangles, even if the word "Trigonometry" never appears.

Refining your Filter

If the filter is catching too much or too little, you can refine it in the prompt:

  • "Keep only question pages. Strictly exclude marking schemes."
  • "Only include pages that contain at least one diagram."

Learn how to split these results in How to split PDF files.