Introduction

Welcome to the Docu-analyzer API.

The Docu-analyzer API is a powerful AI-based tool that extracts text, layout, tables, and key-value pairs from various unstructured document formats including HWP, Docx, and PDF.

The extracted data can be immediately utilized in various workflows such as RAG (Retrieval-Augmented Generation) data preprocessing, RPA (Robotic Process Automation), and AI dataset construction.

Synap DocuAnalyzer


Key Features

Maximize data utilization with diverse document support and accurate document structure analysis
Support for Various Business Document Formats

Support for Various Business Document Formats

Supports a wide range of document formats including Hangul / MS-Office / PDF / Images, enabling compatibility with most document types held by enterprises

Perfect Analysis of Hidden Document Structure

Perfect Analysis of Hidden Document Structure

  • Recognition of detailed document structure information including headings, paragraphs, headers, footers, page numbers, captions, lists, etc.
  • Recognition of visual information such as tables and images
Highly Practical Output Formats

Highly Practical Output Formats

  • Markdown output support for Large Language Model (LLM) construction
  • XML output support for enterprise database construction

Main Features

Powerful features for document structure analysis, from table recognition to document reading order support
Main Feature 1 Main Feature 2 Main Feature 3 Main Feature 4

Use Cases

LLM Model Training
LLM Model Training

Understanding various types of documents and
building LLM models based on owned documents

Conversational AI (RAG)
Conversational AI (RAG)

Improving natural language processing capabilities through understanding documents including tables and images
and developing conversational AI Q&A systems

Business Automation (RPA)
Business Automation (RPA)

Extracting only necessary information to
build business automation systems

Digital Archive
Digital Archive

Structuring unstructured documents and
building large-scale document digital archives


Next Steps

  • Quickstart: Quickly request your first document analysis and check the results
  • Authentication: Learn how to obtain API keys and authenticate
  • Supported Formats: Check the complete list of analyzable file formats