AI Tools to Analyze Complex Documents

Generative AI is useful for condensing and simplifying huge quantities of information. Use circumstances for entrepreneurs and enterprise execs embrace:

  • Explaining sophisticated paperwork, resembling patents.
  • Summarizing lengthy narratives into key factors and definitions.
  • Extracting vital sections of ebooks and whitepapers for blogs and promotional supplies.

What follows are 4 gen AI instruments to perform these duties. To check, I instructed the instruments to research a 2023 patent from Google for Search Generative Expertise, with three prompts:

  • What is that this doc about?
  • How does SGE choose exterior assets in its solutions? Do the assets originate from natural search outcomes?
  • Outline important phrases from the doc.

My objectives had been to guage the accuracy of the responses and verify for “hallucinating” — inventing information.



ChatGPT can now course of uploaded paperwork, together with PDF recordsdata. It supplied a generic overview of the immediate, “What is that this doc about?”

However it was the one instrument that answered the second query, “Did the assets originate from natural search outcomes?”

In response to the patent doc, not all paperwork are chosen solely from natural search outcomes. The choice course of entails completely different classes of search outcome paperwork:

Question-Responsive Search Outcome Paperwork: These are paperwork obtained in response to the unique question submitted by the person.

Associated-Question-Responsive Search Outcome Paperwork: These paperwork are derived from searches carried out for queries associated to the unique question.

Current-Search-Responsive Search Outcome Paperwork: These are paperwork that responded to latest queries, presumably associated to the present question or person’s search historical past.

ChatGPT made up the names of the classes (in daring)! The patent doesn’t use these phrases.

So I prompted, “I don’t see these class names within the authentic doc. Did you make them up?”

ChatGPT’s response:

You’re appropriate, and I apologize for the confusion. The class names I used (“Question-Responsive Search Outcome Paperwork,” “Associated-Question-Responsive Search Outcome Paperwork,” and “Current-Search-Responsive Search Outcome Paperwork”) had been certainly my very own phrasing to summarize the ideas described within the patent. They don’t seem to be immediately quoted from the doc.

The patent really discusses the method of choosing paperwork based mostly on their responsiveness to a given question, associated queries, and up to date searches. These alternatives can embrace paperwork that responded to the unique question, in addition to these responding to associated or latest queries. My intent was to make clear this course of, however I ought to have said clearly that these class names had been my very own summarizations, not direct quotations from the doc.

Thanks for mentioning this discrepancy.

ChatGPT’s response is an instance of hallucinating, a weak spot of many gen-AI instruments.


Home page of Claude


Claude is an impartial generative AI mannequin from Anthropic that doesn’t use ChatGPT, not like the opposite instruments on this record. In my testing, Claude was essentially the most cautious in responding.

For instance, the opposite three instruments answered “Sure” to “Are the paperwork chosen from the natural search outcomes?” Solely Claude provided a nuanced response:

The patent doesn’t explicitly state if the paperwork are chosen from natural search outcomes.

Nonetheless, there are a number of contextual clues that counsel they’re referring to natural search outcomes:

    • The background describes offering query-based summaries of “search outcome paperwork” with out specifying if these are natural or paid outcomes. Sometimes “search outcomes” refers to natural outcomes returned by a search engine.
    • There isn’t a point out of filtering out or dealing with paid/promoted outcome paperwork otherwise than natural paperwork when choosing paperwork to summarize.
    • The purpose appears to be summarizing essentially the most related paperwork to the search question to supply an outline for customers. Natural search outcomes are usually extra optimized for relevancy than promotional or paid outcomes.
    • Associated queries are used to broaden doc choice. Associated/expanded queries are usually used with natural doc units, not paid promoting outcomes.

So whereas not definitively said, the patent language strongly suggests the query-responsive paperwork being chosen to generate summaries are natural search outcomes quite than paid or promotional outcomes.

PDF Buddies

Home page of PDF Pals

PDF Buddies

PDF Buddies is a desktop software that provides 30 free questions for testing. It requires a ChatGPT API key for these exams.

As soon as put in, PDF Buddies can course of uploaded PDF paperwork.

The instrument helpfully contains web page numbers with its responses, making it simpler to confirm the information. Clicking any web page quantity will take you to that part within the doc.

In my testing, PDF Buddies didn’t simplify the patent to my stage of understanding. Its responses had been too technical, regardless of my prompting it in any other case. Nonetheless, the summaries had been helpful, albeit sophisticated.


Home page of AskYourPDF


AskYourPDF is an online app requiring no API key for testing. After scanning a doc, AskYourPDF suggests non-compulsory follow-up questions. Like PDF Buddies, it contains web page numbers, though they don’t seem to be clickable.

AskYourPDF’s responses had been simpler to know than PDF Buddies’ and, conversely, much less complete. And it didn’t extract definitions from the PDF patent, stating incorrectly that none had been there.

Thus AskYourPDF in my testing was useful for higher-level overviews however not detailed. A good thing about that method, nonetheless, is probably going fewer hallucinations.

Curiously, all 4 instruments analyzed the Google PDF patent barely otherwise. Every supplied distinctive explanations. The secret’s verifying the information. All the instruments made errors.

Latest news
Related news


Please enter your comment!
Please enter your name here