WeChat OCR API Docker

A Dockerized REST API service for text recognition using WeChat's OCR engine.

Overview

This project wraps the WeChat OCR functionality from the excellent wechat-ocr project into a simple REST API service that can be easily deployed using Docker. It allows you to perform optical character recognition on images by leveraging WeChat's powerful OCR capabilities.

⚠️ Warning

This is an open-source project intended for learning and communication purposes only. Please do not use it for commercial activities. Users are solely responsible for any consequences resulting from improper use of this project.

Copyright Disclaimer: This project is merely a containerization of an existing open-source project. If you believe this repository infringes upon your copyright or intellectual property rights, please contact the repository owner immediately, and the repository will be promptly removed. We respect intellectual property rights and have no intention of infringing on them.

Acknowledgements

This project would not be possible without the work of swigger and their wechat-ocr project. Their efforts in reverse-engineering and creating a usable interface for WeChat's OCR functionality form the foundation of this service.

Quick Start

Using Docker

# Pull the image
docker pull golangboyme/wxocr

# Run the container
docker run -d -p 5000:5000 --name wechat-ocr-api golangboyme/wxocr

API Usage

Send a POST request to /ocr with a JSON payload containing your base64-encoded image:

curl -X POST http://localhost:5000/ocr \
  -H "Content-Type: application/json" \
  -d '{"image": "BASE64_ENCODED_IMAGE_DATA"}'

Example Response

{
  "errcode": 0,
  "height": 72,
  "width": 410,
  "imgpath": "temp/5726fe7b-25d6-43a6-a50d-35b5f668fbb6.png",
  "ocr_response": [
    {
      "text": "aacss",
      "left": 80.63632202148438,
      "top": 29.634929656982422,
      "right": 236.47093200683594,
      "bottom": 55.28932189941406,
      "rate": 0.9997046589851379
    },
    {
      "text": "xxzsa",
      "left": 312.625,
      "top": 30.75,
      "right": 395.265625,
      "bottom": 55.09375,
      "rate": 0.997739315032959
    }
  ]
}

Python Client Example

Here's a simple Python client to use the OCR API:

import requests
import base64
import os

def ocr_recognize(image_path=None, image_url=None, api_url="http://localhost:5000/ocr"):
    """
    Send an image to the OCR API service and get the recognition results.
    Use either image_path or image_url (one is required).
    """
    # Get image data
    if image_path:
        if not os.path.exists(image_path):
            print(f"Error: Local image not found: {image_path}")
            return
        with open(image_path, "rb") as image_file:
            img_data = image_file.read()
    elif image_url:
        try:
            response = requests.get(image_url)
            response.raise_for_status()
            img_data = response.content
        except Exception as e:
            print(f"Failed to download image: {str(e)}")
            return
    else:
        print("Please provide either image_path or image_url")
        return

    # Convert image to base64
    base64_image = base64.b64encode(img_data).decode('utf-8')
    
    # Send request to API
    try:
        response = requests.post(api_url, json={"image": base64_image})
        response.raise_for_status()
        return response.json()
    except Exception as e:
        print(f"API request failed: {str(e)}")
        return None

# Example usage
if __name__ == "__main__":
    # Local image example
    result = ocr_recognize(image_path="ocrtest.png")
    if result:
        print(result)
    
    # URL image example (uncomment to use)
    # result = ocr_recognize(image_url="https://example.com/image.png")

Project Structure

main.py: The Flask API service that handles OCR requests
opt/wechat/wxocr: WeChat OCR binary
opt/wechat/: WeChat runtime dependencies

Technical Details

This service uses a Flask application to provide a REST API interface to the WeChat OCR functionality. When an image is submitted:

The base64-encoded image is decoded
A temporary file is created
The image is processed by the WeChat OCR engine via the wcocr Python binding
Results are returned in JSON format
Temporary files are cleaned up

Limitations

Currently only supports PNG images (can be extended if needed)
Depends on WeChat's OCR binaries which may be updated by WeChat

License

This project is provided under MIT License with the following additional terms:

The repository owner disclaims all liability for any use of this software.
By using, copying, modifying, or distributing this software, you acknowledge that any actions taken with this software are solely your own responsibility.
Any use of this repository and its contents is done at the user's own risk and discretion.
The repository owner is not responsible for any consequences, legal or otherwise, resulting from the use of this software.

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
images		images
templates		templates
wx/opt/wechat		wx/opt/wechat
.gitignore		.gitignore
.python-version		.python-version
Dockerfile		Dockerfile
README.MD		README.MD
docker-compose.yml		docker-compose.yml
main.py		main.py
pyproject.toml		pyproject.toml
test.sh		test.sh
uv.lock		uv.lock
wcocr.cpython-312-x86_64-linux-gnu.so		wcocr.cpython-312-x86_64-linux-gnu.so

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

WeChat OCR API Docker

Overview

⚠️ Warning

Acknowledgements

Quick Start

Using Docker

API Usage

Example Response

Python Client Example

Project Structure

Technical Details

Limitations

License

Contributing

About

Releases

Packages

Languages

laoertongxue/wxocr

Folders and files

Latest commit

History

Repository files navigation

WeChat OCR API Docker

Overview

⚠️ Warning

Acknowledgements

Quick Start

Using Docker

API Usage

Example Response

Python Client Example

Project Structure

Technical Details

Limitations

License

Contributing

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages