forked from PaddlePaddle/PaddleOCR
-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
* updata en docs * updata en docs * updata en docs * updata en docs
- Loading branch information
Showing
8 changed files
with
274 additions
and
8 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,136 @@ | ||
## 1. Introduction to All-in-One Development | ||
|
||
The All-in-One development tool [PaddleX](https://github.com/PaddlePaddle/PaddleX/tree/release/3.0-beta1), based on the advanced technology of PaddleOCR, supports **low-code full-process** development capabilities in the OCR field. Through low-code development, simple and efficient model use, combination, and customization can be achieved. This will significantly **reduce the time consumption** of model development, **lower its development difficulty**, and greatly accelerate the application and promotion speed of models in the industry. Features include: | ||
|
||
* 🎨 [**Rich Model One-Click Call**](https://paddlepaddle.github.io/PaddleOCR/latest/en/paddlex/quick_start.html): Integrates **17 models** related to text image intelligent analysis, general OCR, general layout parsing, table recognition, formula recognition, and seal recognition into 6 pipelines, which can be quickly experienced through a simple **Python API one-click call**. In addition, the same set of APIs also supports a total of **200+ models** in image classification, object detection, image segmentation, and time series forcasting, forming 20+ single-function modules, making it convenient for developers to use **model combinations**. | ||
|
||
* 🚀 [**High Efficiency and Low barrier of entry**](https://paddlepaddle.github.io/PaddleOCR/latest/en/paddlex/overview.html): Provides two methods based on **unified commands** and **GUI** to achieve simple and efficient use, combination, and customization of models. Supports multiple deployment methods such as **high-performance inference, service-oriented deployment, and edge deployment**. Additionally, for various mainstream hardware such as **NVIDIA GPU, Kunlunxin XPU, Ascend NPU, Cambricon MLU, and Haiguang DCU**, models can be developed with **seamless switching**. | ||
|
||
> **Note**: PaddleX is committed to achieving pipeline-level model training, inference, and deployment. A model pipeline refers to a series of predefined development processes for specific AI tasks, including combinations of single models (single-function modules) that can independently complete a type of task. | ||
## 2. OCR-Related Capability Support | ||
|
||
In PaddleX, all 6 OCR-related pipelines support **local inference**, and some pipelines support **online experience**. You can quickly experience the pre-trained model effects of each pipeline. If you are satisfied with the pre-trained model effects of a pipeline, you can directly proceed with [high-performance inference](https://github.com/PaddlePaddle/PaddleX/blob/release/3.0-beta1/docs/pipeline_deploy/high_performance_deploy_en.md)/[service-oriented deployment](https://github.com/PaddlePaddle/PaddleX/blob/release/3.0-beta1/docs/pipeline_deploy/service_deploy_en.md)/[edge deployment](https://github.com/PaddlePaddle/PaddleX/blob/release/3.0-beta1/docs/pipeline_deploy/lite_deploy_en.md). If not satisfied, you can also use the **custom development** capabilities of the pipeline to improve the effects. For the complete pipeline development process, please refer to [PaddleX Pipeline Usage Overview](https://github.com/PaddlePaddle/PaddleX/blob/release/3.0-beta1/docs/pipeline_usage/pipeline_develop_guide_en.md) or the tutorials for each pipeline. | ||
|
||
In addition, PaddleX provides developers with a full-process efficient model training and deployment tool based on a [cloud-based GUI](https://aistudio.baidu.com/pipeline/mine). Developers **do not need code development**, just need to prepare a dataset that meets the pipeline requirements to **quickly start model training**. For details, please refer to the tutorial ["Developing Industrial-level AI Models with Zero Barrier"](https://aistudio.baidu.com/practical/introduce/546656605663301). | ||
|
||
<table> | ||
<tr> | ||
<th>Pipeline</th> | ||
<th>Online Experience</th> | ||
<th>Local Inference</th> | ||
<th>High-Performance Inference</th> | ||
<th>Service-Oriented Deployment</th> | ||
<th>Edge Deployment</th> | ||
<th>Custom Development</th> | ||
<th><a href="https://aistudio.baidu.com/pipeline/mine">No-Code Development On AI Studio</a></td> | ||
</tr> | ||
<tr> | ||
<tr> | ||
<td>OCR</td> | ||
<td><a href="https://aistudio.baidu.com/community/app/91660/webUI?source=appMineRecent">Link</a></td> | ||
<td>✅</td> | ||
<td>✅</td> | ||
<td>✅</td> | ||
<td>✅</td> | ||
<td>✅</td> | ||
<td>✅</td> | ||
</tr> | ||
<tr> | ||
<td>PP-ChatOCRv3</td> | ||
<td><a href="https://aistudio.baidu.com/community/app/182491/webUI?source=appCenter">Link</a></td> | ||
<td>✅</td> | ||
<td>✅</td> | ||
<td>✅</td> | ||
<td>🚧</td> | ||
<td>✅</td> | ||
<td>✅</td> | ||
</tr> | ||
<tr> | ||
<td>Table Recognition</td> | ||
<td><a href="https://aistudio.baidu.com/community/app/91661?source=appMineRecent">Link</a></td> | ||
<td>✅</td> | ||
<td>✅</td> | ||
<td>✅</td> | ||
<td>🚧</td> | ||
<td>✅</td> | ||
<td>✅</td> | ||
</tr> | ||
<tr> | ||
<td>Layout Parsing</td> | ||
<td>🚧</td> | ||
<td>✅</td> | ||
<td>🚧</td> | ||
<td>✅</td> | ||
<td>🚧</td> | ||
<td>✅</td> | ||
<td>🚧</td> | ||
</tr> | ||
<tr> | ||
<td>Formula Recognition</td> | ||
<td>🚧</td> | ||
<td>✅</td> | ||
<td>🚧</td> | ||
<td>✅</td> | ||
<td>🚧</td> | ||
<td>✅</td> | ||
<td>🚧</td> | ||
</tr> | ||
<tr> | ||
<td>Seal Recognition</td> | ||
<td>🚧</td> | ||
<td>✅</td> | ||
<td>✅</td> | ||
<td>✅</td> | ||
<td>🚧</td> | ||
<td>✅</td> | ||
<td>🚧</td> | ||
</tr> | ||
</table> | ||
|
||
|
||
> ❗Note: The above capabilities are implemented based on GPU/CPU. PaddleX can also perform local inference and custom development on mainstream hardware such as Kunlunxin, Ascend, Cambricon, and Haiguang. The table below details the support status of the pipelines. For specific supported model lists, please refer to the [Model List (Kunlunxin XPU)](https://github.com/PaddlePaddle/PaddleX/blob/release/3.0-beta1/docs/support_list/model_list_xpu_en.md)/[Model List (Ascend NPU)](https://github.com/PaddlePaddle/PaddleX/blob/release/3.0-beta1/docs/support_list/model_list_npu_en.md)/[Model List (Cambricon MLU)](https://github.com/PaddlePaddle/PaddleX/blob/release/3.0-beta1/docs/support_list/model_list_mlu_en.md)/[Model List (Haiguang DCU)](https://github.com/PaddlePaddle/PaddleX/blob/release/3.0-beta1/docs/support_list/model_list_dcu_en.md). We are continuously adapting more models and promoting the implementation of high-performance and service-oriented deployment on mainstream hardware. | ||
**🚀 Support for Domestic Hardware Capabilities** | ||
|
||
<table> | ||
<tr> | ||
<th>Pipeline Name</th> | ||
<th>Ascend 910B</th> | ||
<th>Kunlunxin XPU</th> | ||
<th>Cambricon MLU</th> | ||
<th>Haiguang DCU</th> | ||
</tr> | ||
<tr> | ||
<td>General OCR</td> | ||
<td>✅</td> | ||
<td>✅</td> | ||
<td>✅</td> | ||
<td>🚧</td> | ||
</tr> | ||
<tr> | ||
<td>Table Recognition</td> | ||
<td>✅</td> | ||
<td>🚧</td> | ||
<td>🚧</td> | ||
<td>🚧</td> | ||
</tr> | ||
</table> | ||
|
||
## 3. List and Tutorials of OCR-Related Model Pipelines | ||
|
||
- **OCR Pipeline**: [Tutorial](https://github.com/PaddlePaddle/PaddleX/blob/release/3.0-beta1/docs/pipeline_usage/tutorials/ocr_pipelines/OCR_en.md) | ||
- **Table Recognition Pipeline**: [Tutorial](https://github.com/PaddlePaddle/PaddleX/blob/release/3.0-beta1/docs/pipeline_usage/tutorials/ocr_pipelines/table_recognition_en.md) | ||
- **PP-ChatOCRv3-doc Pipeline**: [Tutorial](https://github.com/PaddlePaddle/PaddleX/blob/release/3.0-beta1/docs/pipeline_usage/tutorials/information_extration_pipelines/document_scene_information_extraction_en.md) | ||
- **Layout Parsing Pipeline**: [Tutorial](https://github.com/PaddlePaddle/PaddleX/blob/release/3.0-beta1/docs/pipeline_usage/tutorials/ocr_pipelines/layout_parsing_en.md) | ||
- **Formula Recognition Pipeline**: [Tutorial](https://github.com/PaddlePaddle/PaddleX/blob/release/3.0-beta1/docs/pipeline_usage/tutorials/ocr_pipelines/formula_recognition_en.md) | ||
- **Seal Recognition Pipeline**: [Tutorial](https://github.com/PaddlePaddle/PaddleX/blob/release/3.0-beta1/docs/pipeline_usage/tutorials/ocr_pipelines/seal_recognition_en.md) | ||
|
||
## 4. List and Tutorials of OCR-Related Modules | ||
|
||
- **Text Detection Module**: [Tutorial](https://github.com/PaddlePaddle/PaddleX/blob/release/3.0-beta1/docs/module_usage/tutorials/ocr_modules/text_detection_en.md) | ||
- **Seal Detection Module**: [Tutorial](https://github.com/PaddlePaddle/PaddleX/blob/release/3.0-beta1/docs/module_usage/tutorials/ocr_modules/seal_text_detection_en.md) | ||
- **Text Recognition Module**: [Tutorial](https://github.com/PaddlePaddle/PaddleX/blob/release/3.0-beta1/docs/module_usage/tutorials/ocr_modules/text_recognition_en.md) | ||
- **Formula Recognition Module**: [Tutorial](https://github.com/PaddlePaddle/PaddleX/blob/release/3.0-beta1/docs/module_usage/tutorials/ocr_modules/formula_recognition_en.md) | ||
- **Table Structure Recognition Module**: [Tutorial](https://github.com/PaddlePaddle/PaddleX/blob/release/3.0-beta1/docs/module_usage/tutorials/ocr_modules/table_structure_recognition_en.md) | ||
- **Text Image Unwarping Module**: [Tutorial](https://github.com/PaddlePaddle/PaddleX/blob/release/3.0-beta1/docs/module_usage/tutorials/ocr_modules/text_image_unwarping_en.md) | ||
- **Layout Detection Module**: [Tutorial](https://github.com/PaddlePaddle/PaddleX/blob/release/3.0-beta1/docs/module_usage/tutorials/ocr_modules/layout_detection_en.md) | ||
- **Document Image Orientation Classification Module**: [Tutorial](https://github.com/PaddlePaddle/PaddleX/blob/release/3.0-beta1/docs/module_usage/tutorials/ocr_modules/doc_img_orientation_classification_en.md) |
Oops, something went wrong.