Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to content
View sofiawu's full-sized avatar

Block or report sofiawu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Document Artifical Intelligence

119 4 Updated Oct 11, 2024
Python 171 18 Updated Oct 10, 2024

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 3,029 358 Updated Aug 19, 2024

A comprehensive collection of IQA papers

TeX 950 65 Updated Oct 9, 2024
12 1 Updated Jun 19, 2024

ACM Multimedia 2023: DocDiff: Document Enhancement via Residual Diffusion Models. Also contains 1597 red seals in Chinese scenes, along with their corresponding binary masks.

Python 224 23 Updated Aug 22, 2024

A comprehensive list of awesome document image rectification papers.

356 30 Updated Apr 2, 2024

Efficient, scalable and enterprise-grade CPU/GPU inference server for 🤗 Hugging Face transformer models 🚀

Python 1,646 150 Updated Oct 25, 2023

Implementation of handwriting generation with use of recurrent neural networks in tensorflow. Based on Alex Graves paper (https://arxiv.org/abs/1308.0850).

Python 533 107 Updated Jan 14, 2018

A synthetic data generator for text recognition

Python 3,244 968 Updated Jul 18, 2024

This repository contains a paper collection of the methods for document image processing, including appearance enhancement, deshadow, dewarping, deblur, and binarization.

156 10 Updated Sep 13, 2024

This repository contains demos I made with the Transformers library by HuggingFace.

Jupyter Notebook 9,210 1,417 Updated Aug 8, 2024

Augmentation pipeline for rendering synthetic paper printing, faxing, scanning and copy machine processes

Python 341 44 Updated Sep 17, 2024

Generates CGI sample receipts for use in receipt scanning CV automated tests

Python 84 8 Updated Nov 22, 2022

Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022

Python 5,755 467 Updated Jul 11, 2024

OCR & Ground Truth Resources

75 11 Updated May 3, 2022

Code for AAAI 2023 Paper : “Alignment-Enriched Tuning for Patch-Level Pre-trained Document Image Models”

Python 17 3 Updated Dec 6, 2022

Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)

Python 9,606 1,386 Updated Jul 31, 2023

The official code for “Deep Unrestricted Document Image Rectification”, TMM, 2023.

Python 406 44 Updated Aug 15, 2024

The official code for “DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction”, ACM MM, Oral Paper, 2021.

Python 348 49 Updated Jul 21, 2024

Synthesize distorted document image and control points.

Python 41 11 Updated Sep 14, 2022

This is a pytorch implementation of DocUNet: Document Image Unwarping via A Stacked U-Net

Jupyter Notebook 110 32 Updated Feb 15, 2019

Code for the paper "DewarpNet: Single-Image Document Unwarping With Stacked 3D and 2D Regression Networks" (ICCV '19)

Python 497 97 Updated Jul 6, 2023

Document Image Binarization

Python 72 14 Updated Aug 27, 2024

Save ranges from Excel documents as images

Python 96 25 Updated Dec 9, 2020

A selectional auto-encoder approach for document image binarization

Python 101 23 Updated Dec 8, 2022

Unofficial implementation of DocMAE (WIP): Document Image Rectification via Self-supervised Representation Learning

Python 14 3 Updated Dec 20, 2023

Table Transformer (TATR) is a deep learning model for extracting tables from unstructured documents (PDFs and images). This is also the official repository for the PubTables-1M dataset and GriTS ev…

Python 2,237 250 Updated Jun 24, 2024

DocEnTr: An end-to-end document image enhancement transformer - ICPR 2022

Jupyter Notebook 138 33 Updated Sep 2, 2024

Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch

Python 11,090 1,087 Updated May 11, 2024
Next