Skip to main content Link Menu Expand (external link) Document Search Copy Copied

Scene Text Editing papers

SRNet

  • style retention network (SRNet)
  • Editing Text in the wild. 이걸 기반해서 SwapText, STEFANN이 나옴

  • Three modules
    1. text conversion module : changes the text content of the source image into the target text while keeping the original text style
    2. background inpainting module : erases the original text and fills the text region with appropriate texture
    3. fusion module : combines the information from the two former modules, and generates the edited text images
  • paper : https://arxiv.org/pdf/1908.03047.pdf
  • code : https://github.com/endy-see/SRNet-1

STEFANN (CVPR 2020)

  • character-level text editing in image
  • the unobserved character (target) is generated from an observed character (source) being modified
  • replace the source character with the generated character maintaining both geometric and visual consistency with neighboring characters

  • paper : STEFANN_CVPR_2020_paper.pdf
  • code: https://github.com/prasunroy/stefann

RewriteNet (CVPRW 2022)

STRIVE (ICCV 2021)

MOSTEL (AAAI 2023)

  • MOdifying Scene Text image at strokE Level (MOSTEL)
  • generate stroke guidance maps to explicitly indicate regions to be edited
  • propose a Semisupervised Hybrid Learning to train the network with both labeled synthetic images and unpaired real scene text images

  • paper : https://arxiv.org/pdf/2212.01982.pdf
  • code : https://github.com/qqqyd/MOSTEL

SwapText (CVPR 2020)

  • 하고 싶은 task 에 가장 가까운 연구임. 하지만 코드 공개가 안되어있음..
  • a three-stage framework to transfer texts across scene images
    1. first stage: a novel text swapping network to replace text labels only in the foreground image
    2. second stage: a background completion network to reconstruct background images
    3. third stage: the fusion network generate the word image by using the foreground and background images
  • paper : swaptext.pdf
  • code : not released

Font style transfer (cross-language)

FTransGAN (WACV 2021)

OCR

API

Translation

API

Papago : how to use papago api

Image Inpainting

RePaint (CVPR 2022)

GARnet (ECCV 2022)

  • Scene text removal paper using Gated Attention and RoI Generation method
  • Gated Attention : focus on the text stroke as well as the textures and colors of the surrounding regions to remove text from the input image much more precisely
  • RoI Generation : focus on only the region with text instead of the entire image to train the model more efficiently

  • paper : 4705_ECCV_2022_paper
  • code : https://github.com/naver/garnet