CCLee / Blog / EAST: A Text Detection Algorithm

Remark. Unless specified otherwise our serverless framework will be kept at 3.38.0.

August 27,

2025

Multithreading with Semaphore

kotlin

spring

In the past we have studied how to do limited number of concurrent tasks via buffered channels, an approach inspired from golang. This time we make use of native API of Semaphore to achieve the same goal.

August 23,

2025

Value Objects and Embedded Classes

kotlin

spring

We study how to embed value object into JPA entity class definition, and study how to auto-map our value object back to plain DTO object for the frontend

August 17,

2025

PlantUML for EventSourcing and an GUI-application for Editing)

puml

August 3,

2025

讓 m4 macbook air 擁有 3 個 external display

3c

personal

August 2,

2025

Postgresql and MySQL DB from Docker-Compose and Clone From Existing DB

docker

mysql

postgresql

sql

Record a docker-compose.yml to host a postgresql/mysql db locally. Also we clone existing DB by docker-command using postgres image

July 30,

2025

居屋 2024 注意時項

personal

物業

購買居屋前後需要注意的時項及期限。

July 22,

2025

Multi-threading For Elementary Tasks in Browsers

js

nodejs

react

We use additional thread to execute computational task by built-in workers API

July 1,

2025

A Simple Proof to Stirling's Formula

math

We study the asymptotic behaviour of n!

June 30,

2025

JPA Case Study: When repository.delete(someEntity) failed in a transaction silently

event-driven

jpa

springboot

We study a special case where the repository.delete() can fail silently

June 28,

2025

Two kinds of Side Effects in JPA

event-driven

jpa

springboot

We mention two common patterns in event-driven design for Spring boot that handles side effects due to domain events

June 21,

2025

Migrate from ThunkActions to RTK-Query

vscode

A simple introduction to rtk-query

June 17,

2025

Float Desired Folders on Top in Vscode

vscode

We record a method on how to reorder folders based on our predefined pattern

June 5,

2025

Fundamentals of Nestjs

nestjs

Record the study of standard concepts from nestjs while working in the current company.

May 21,

2025

Deletion of a VPC with Dangling/Orphaned ENIs

terraform

We discuss how to remove a VPC completely when dangling ENIs remain there.

May 18,

2025

Migration from Terraform-Cloud to S3-Backend; Study on Terragrunt

terraform

terragrunt

We study terragrunt via opentofu

May 9,

2025

PR and Empty Git Commit via SimpleGit in Nodejs

git

We discuss programmatic way to git push changes and git commit empty commit for triggering github workflow.

May 6,

2025

Environment Variables via Secret Managers in Backend using Expo Approach

aws

env

secrets-manager

We immitate the expo approach for all of our backend applications, here is how!

May 2,

2025

How to Build an NPM Package?

npm-package

We try to build and develop an npm package.

April 30,

2025

Count Execution Time of a Shell Script

shell

Code to count the duration of a shell script

April 28,

2025

Replace Redux by Zustand and React-Query

react

react-query

zustand

Record a standard setup as a transition from redux to zustand. This article will be kept updated along the way I delve deeper using zustand.

April 25,

2025

Recover Bullet-Points and Ordered List Overriden by Tailwind

react

tailwind

Record simple css to recover bullet points affected by tailwind package

April 24,

2025

How to work with Multiple ENV files in Serverless Framework??

env

nodejs

serverless

Record method to load multiple env files in serverless framework.

April 22,

2025

useBlocker hook to Block the Change of Route for Unsaved Update

react

Record a simple hook to block route traffic when updates have not been saved yet, and popup an alert to ask for confirmation.

April 20,

2025

"In-line" Policies via Serverless.yml: ① Self-Invokation ② s3 GetObject

aws

lambda

serverless

Study cases when a lambda function need to invoke itself.

April 14,

2025

Transform Execls file Into Json

excel

nodejs

A record of simple excel to db-record task.

April 13,

2025

Amazon EventBridge Schedule to Trigger Springboot Endpoint

aws

Simple scheduling to our springboot lambda function's endpoint

April 12,

2025

Google Login in Popup Window

google-cloud

Google login without redirection of main page by popping up a new window.

April 9,

2025

Import Existing Resources Into Terraform Project

terraform

Certain resources are not supposed to be recreated even we recreate our infrastructure via terraform. Let's study how we define and import database and s3 bucket separately.

April 7,

2025

Terraform Modularization for DRY Deployment from DEV to UAT and Input Infrastructure Information

terraform

Discuss how to make our terraform project as DRY as possible.

April 1,

2025

Lambda Client

aws

lambda

serverless

It is very common to have lamdba function being called by another function. This time we study two kinds of lambda functions to be invoked, one is snapstarted springboot lambda function, another one is an ordinary console lambda function.

March 30,

2025

Custom Modal Simplification

react

Creation of modal is always a tedious task. We can create modal easily by ant-design's Modal component. But improper implementation can lead to plenty of highly non-reusable modals as the common mistake is to create that modal at the component we use it, making it tightly coupled with the state at which the modal is defined.

March 24,

2025

Code Separation of Domain Entity Class, Domain Behaviour (Actions) and the Corresponding Validations

DDD

kotlin

When business grows, a domain object becomes very bulky if all behaviour and the corresponding validation are added to the same entity file. We study code separation by the language feature of Kotlin using function literal with receiver.

March 22,

2025

AWS Websocket-API 2: Complete Integration of React and Spring Boot using Websocket-API from API-Gateway

web-socket

We discuss a reliable approach to connect websocket api from api-gateway.

March 21,

2025

Combine useQuery and Redux Toolkit Thunk Actions

react

react-query

redux

We discuss how to bring the power of caching/debouncing, the cahcing and cache-invalidatin, and also the handy booleans like isLoading of react query into world of react thunk actions.

March 20,

2025

Generate Typescript Interfaces from Swagger-UI

swagger-ui

We record how to transform all the schemas in swagger ui into typescript interfaces.

March 19,

2025

Run npm Script Using Gradle Task

gradle

springboot

We record how to define gradle task to run npm script and how to ensure the gradle can recognize our npm binary

March 18,

2025

Mongo in "JPA": spring-boot-starter-data-mongodb

jpa

mongo

springboot

Record how to interact with mongodb in a JPA manner.

March 16,

2025

Manage Multiple Github Accounts Using SSH Keys in One Machine

git

Record a standard step to alter the authentication method (choose which account to authenticate) when we push changes to a repository.

March 15,

2025

Speed up Data Migration using JPA with Channels and CountDownLatch

kotlin

springboot

We record a rate-limited concurrent tasks techique borrowed from golang.

March 14,

2025

Deduplicate data in Database

sql

Record a simple migration script to remove all duplicated rows before adding a unique index to a column in a table.

March 13,

2025

Toolkits and Caveats Working with Spring Boot and JPA

springboot

This is a record of machineries in a springboot project for my own convenience.

March 8,

2025

CLI Program by Bubble Tea

bubble-tea

go

We record the basic and elements of bubble tea.

March 4,

2025

Pruning old Lambda Functions

lambda

serverless

Record a configuration to clean up old versions of lambda functions.

March 2,

2025

Golang Simple yet Useful Knowledge

go

Record useful tricks in golang

February 27,

2025

Terraform Project to Manage Console-Created Lambdas

terraform

Record an architecture to manage simple lambda functions that can be created directly in aws console.

February 15,

2025

Terraform from Real Project Experience: ECS, Lambda, RDS, RDS-Proxy, VPC, VPC-Endpoints and IAM-Roles

terraform

Record the functioning configuration of usual resources.

February 13,

2025

Self-Managed Kubenetes Cluster Using k3s, EC2 and RDS

terraform

Record a creation of a simple worker node using EC2 and K3S via terraform

January 19,

2025

General Concept in Terraform (Revisited)

terraform

After real project experience with AWS cloud, I revisit the knowledge in terraform in order to replicate the cloud infrastructure from UAT to PROD effectively.

January 13,

2025

Connection to RDS-Proxy Using Lambda Functions

aws

lambda

rds

rds-proxy

We study the configuration of IAM roles and permissions to enable lambda functions to connect to RDS-Proxy which is a VPC-bounded resource

January 12,

2025

AWS Presigned URLs for File-Uploading

aws

Record a simple script to create a presigned-url for file uploading in frontend

January 5,

2025

AWS Websocket-API 1: An overview from API Gateway

api-gateway

aws

web-socket

We study the basic of websocket-api from api-gateway.

January 2,

2025

① Swagger-UI for Spring Boot in SnapStarted Lambda ② Automation of Authorization-Header Assignment for API Testings ③ Basic Functionalities

springboot

swagger-ui

We study the integration of snapStarted lambda with API-gateway.

January 1,

2025

React-Query Fundamentals

react

react-query

Record the basic usage of react-query.

December 31,

2024

Unit Testing in Kotlin with MockK

mockk

springboot

test

Let's get rid of the deprecated react beautiful dnd.

December 29,

2024

Pragmatic drag and drop from Atlassian Design System

react

Let's get rid of the deprecated react beautiful dnd.

December 29,

2024

JPA and DDD Notes from Real Project Experience

DDD

jpa

springboot

We record the use of ksp package that auto-generates DTO mapper for annotated entity classes.

December 28,

2024

General Purpose Dialog using MUI with Flexible { xs, sm, md, lg, ...etc } Widths

react

We record a general purpose dialog whose width can be sm, md, lg, etc to avoid the consideration of width of the dialog content.

December 24,

2024

Auto-generated Mapper From Entity Classes into DTO Classes

jpa

springboot

We record the use of ksp package that auto-generates DTO mapper for annotated entity classes.

December 22,

2024

Redirect from 3rd-party Subdomain (WIX in our example) to AWS Cloudfront Distribution Where we host our Frontend

aws

cloudfront

domain-transfer

Given that a domain is brought from third-party DNS provider, we study how to route the traffic to our cloudfront distribution.

December 19,

2024

Architecture for Private Lamdba functions called via another lambda function

aws

docker

lambda

Let's discuss how to orchistrate the interaction of lambda functions which are network-isolated, especially how to endow lambda functions with security group.

December 18,

2024

Multiple Ngroks Via Docker Compose

docker

Suppose we want to test production-like environment by making a production build for mobile, it is helpful to host multiple backends in localhost and expose it via https, multiple ngroks can do the trick for us.

December 16,

2024

Set up ESLINT and Husky for Vite React Project

react

Setting eslint is not enough as people can still ignore it and commit their code. We introduce husky which provides a pre-commit hook to prevent unpassed code from being pushed.

December 2,

2024

Useful Util Functions to Log Excutation time and the Hibernate Generated Query within a code block by Trailing Closures

springboot

We introduce simple functions for logging.

December 1,

2024

Email Configuration for Spring Boot in SnapStarted Lambda

springboot

Due to the lambda-nature, snapStarted spring boot differs from ordinary spring boot application as it lacks the write permission to the /var/task of the lambda function.

November 30,

2024

Redis Task Distribution in Kotlin

springboot

We record LUSH and BRPOP in kotlin

November 28,

2024

HTTP Request Logger in Springboot

springboot

We record a simple logger for our spring application

November 18,

2024

Testcontainer, a Replacement of In-Memory Database in Testing

docker

springboot

test

In testing we usually make use of in-memory database to separate the test data with our actual working database. For a long time people use H2 in spring boot for that purpose but that database is not eventually what the production uses, now we can align both!

November 17,

2024

Postman Scripts for Testing and Setting Results into Environment Variables

postman

test

We record useful postman script in manual testing.

November 16,

2024

Snapstarted Lambda running Spring Boot and Transition into Spring Boot as a Node.js Developer

aws

lambda

serverless

springboot

Record miscellaneous detail of transitioning into Spring Boot as a node.js developer.

November 9,

2024

ApplicationEventPublisher with Decorators for Domain Driven Design

decorators

typescript

In the past we have introduced an applicationEventPublisher, this time we introduce decorators to further simplify the code!

November 8,

2024

Doubly-Linked List in Typescript

data-structure

We implement a doubly-linked list with generic data type that is useful when trying to group a set of "directional" objects.

November 6,

2024

Lambda Function Running Python Docker Image

aws

docker

lambda

python

We discuss how to create a lambda function that executes in docker container to get around the 250MB size limit.

November 5,

2024

Github Action: Docker Action in Python to get Artifact of CloudWatch Logging

github-actions

Installing python and executing a python script with argument can be tedious to teammates who are not used to python. The same siutation can apply to all other languages, let's simplify the execution by doing it on github actions.

October 27,

2024

Scheduled Tasks in ECS: Demonstration via Regular Database Backup

aws

db-backup

ecs

sql

We study scheduled tasks by doing a regular backup of databases (pgsql + mongo) on a hourly basis.

October 26,

2024

Create an HTML via a PDF File!

pdf

There is occasion where we have created a pdf file and want to have an html version which looks exactly the same (e.g., pdf generated by LaTeX)

October 25,

2024

Restore MongoDB from Backup

db-backup

mongo

Let's discuss how to restore a mongodb from backup, how does the back up actually?

October 25,

2024

Restore PostgreSQL DB from Backup

db-backup

postgresql

sql

Let's discuss how to clone a database again.

October 24,

2024

Github Action: workflow_dispatch

github-actions

We can design a button to determine what and when to do something instead of automating the actions for each push / merge request.

October 20,

2024

JPA with DB-First Approach: Surgery on JOOQ's POJO into Base @Entity Class

jooq

jpa

kotlin

springboot

Working with existing database via jpa amounts to the need to manually model the table by entity classes. We study how to modify the data class generated by JOOQ into a "pre-entity" open class that can be inherited to define domain model.

October 10,

2024

RevenueCat Fundamentals

payment

react-native

revenue-cat

We study the basic steps to enable inapp-purchase.

October 4,

2024

Distributive Lock in Kotlin

redis

A simple implementation of distributive lock in Kotlin.

September 30,

2024

Application Event Publisher for Monolithic DDD in Nodejs

DDD

express

nodejs

We have talked about monolithic DDD in the context of spring boot with the help of spring-provided ApplicationEventPublisher instance via dependency injection. This is not unique to spring and let's bring it into nodejs in the express framework.

September 23,

2024

Run Blocking Servlet Requests to Achieve Non-Blocking Performance

kotlin

springboot

We study how to effectively deliver request to non-blocking coroutine scope and release that thread for other request.

September 21,

2024

SSE, Coroutine and Channel to Notify Frontend for Stripe Events in Spring Boot and Kotlin

kotlin

springboot

A Stripe event is always delayed, and some system relies on database persistent change by Stripe events. We study how frontend can wait for the event before fetching latest data.

September 16,

2024

Function Literals with Receiver

kotlin

Study the syntax for function literals which helps construct DSL-like syntax.

September 15,

2024

Stripe Technical Detail for Add, Upgrade, Downgrade, and Cancel Subscrpition

kotlin

stripe

We record how to adjust a stripe subscription in code using kotlin.

September 12,

2024

Customization on build.gradle.kts

gradle

kotlin

build.gradle.kts is like a Makefile with powerful feature that you can write kotlin code on it, let's study useful customization.

September 7,

2024

Lambda Function Running Spring Boot in Docker Image

kotlin

lambda

springboot

Run a spring boot in docker image using lambda function

September 6,

2024

Lambda Function Running Nodejs Docker Image

aws

docker

lambda

Traditionally lambda function is as simple as running the function defined in a zipped package. But when dependencies get complicated and when file size inevitably exceeds 250MB limit when being unzipped, we need to consider using docker image as an alternative.

September 5,

2024

Containerization of Spring Boot Application in Kotlin and Troubles in Deployment through ECS

docker

ecs

kotlin

We record a two-stages dockerfile to build a docker image of spring boot application.

August 27,

2024

Joda Time

kotlin

Record convenient tool to manage time in kotlin

August 21,

2024

Monolithic DDD Without ORM by Separating Data and Domain Behaviour

DDD

kotlin

springboot

There are two ways for DDD Project in spring boot: DDD using ORM and that with ORM, let's discuss the one without ORM in this article

August 17,

2024

Dockerfile for Springboot Application in Gradle

docker

springboot

Record a two-stage docker file for building springboot application.

August 12,

2024

Useful Query in JOOQ

jooq

kotlin

Record the common usages of JOOQ library.

August 11,

2024

Custom Overlay Scrollbar Container

react

Record a component for simple reuse.

August 10,

2024

Stripe Events and metadata in Stripe Checkout Sessions

kotlin

stripe

When creating a transaction we need to wait for the checkout-session-completed event in order to execute follow-up action in our own system, we relies on metadata to follow the transaction.

August 10,

2024

Quartz DeadlineManager in Saga

axon-framework

kotlin

We record how to require a saga to wait certain amount of time before dispatching cancelling events, even our server restarts.

August 9,

2024

Json Query in PostgreSQL for domain_event_entry in Axon Framework

axon-framework

sql

We record useful query for debugging of domain events with json payload.

August 8,

2024

Access Nested Properties from Json String in Kotlin

kotlin

We study some example of deserialization of json strings using Gson and kotlinx-serialization-json.

August 7,

2024

CORS Configuration for HandlerInterceptors via Spring Security

kotlin

springboot

The generation of the bean of WebMvcConfigurer suffices to provide a global configuration of CORS via Rest Controller, but that fails to configure non-mvc requests (such as those intercepted by our interceptors). Let's study an alternative.

August 6,

2024

Logging of Commands and Events in Axon Framework

axon-framework

kotlin

springboot

We record how to use interceptor and custom annotation to record the flow of commands and events.

August 5,

2024

AOP in Kotlin

kotlin

springboot

We record how to introduce reusuable interceptor in kotlin.

August 4,

2024

Send Gmail in Kotlin

kotlin

springboot

We study how to send email again in kotlin.

August 3,

2024

Subscription Query and Error Handling of API Calls

axon-framework

kotlin

springboot

Study the error handling for Subscription Query.

August 1,

2024

Logger in Kotlin

kotlin

springboot

Record the minimal code template for KotlinLogger.

July 29,

2024

Routing Config in React Router v6

react

Record the usage of react-router v6.

July 28,

2024

A Minimal Example for react-beautiful-dnd

react

Record a minial working example for react-beautiful-dnd.

July 27,

2024

Git-Crypt Study

git

Sometimes it is much more convenient to save multiple config files in a repository than to git-ignore it. We study the encrpytion of those files using git-crypt.

July 14,

2024

Create a React-App via Vite, Config for MUI, Load Env Variable from Custom Files.

react

vite

Record a standard procedure to create an react-app via vite.

July 13,

2024

Various ways of Implementing Chatroom Input Field in React Native

react-native

It is easy to make input field, but building one in chatroom is a bit different.

June 29,

2024

K8S Basics Part III: EKS and Container Logging of Pods in CloudWatch via Fluentbit

aws

k8s

We continue the previous study on k8s, this time the EKS.

June 28,

2024

K8S Basics Part II: Networking

k8s

We continue the previous study on k8s, this time the networking.

June 27,

2024

K8S Basics Part I: Deployment, Service, Volume and ConfigMap

k8s

We study the basic objects in the world of k8s.

June 26,

2024

Time-Series Data via WIDTH_BUCKET and Multi-Column Chart

react

sql

Study a simple query function WIDTH_BUCKET to obtain a set of time-series data and study nice chart library to generate a bar-chart with multi-columns.

June 24,

2024

JWT in Spring boot II: Get rid of Spring-Security. More on Parsing Json String into Pojo

gradle

kotlin

springboot

A JWT authentication should be simple and straight-forward. Let's forget about spring-security and start with the basic idea.

June 23,

2024

JWT in Spring boot I: Using Spring-Security

kotlin

springboot

We try to implement a JWT authentication and study how to construct custom token payload for the backend.

June 22,

2024

Gradle Fundamentals: Modularization of Spring Boot Project and Dependencies Control

gradle

kotlin

springboot

This is a study on gradle to help understand how to manage a project by modules. In java this helps separate the dependencies precisely, i.e., no one can access resource from incorrect layer. This makes creating unit tests much more easily (in case you have written queries directly in controller layer, you know what I mean).

June 18,

2024

Global Exception Handler

kotlin

springboot

Config a global setting of how the error should return to frontend.

June 17,

2024

Simple Study of Transaction

kotlin

springboot

We try to throw an exception inside a transaction to see whether a rollback takes place.

June 16,

2024

Event Publishing and Listening in Spring boot

kotlin

springboot

Record a simple setup for event emitter and listener.

June 15,

2024

① Spring boot in Kotlin with JOOQ and Prisma ② Simple Commands for Gradles ③ Integration and Unit Tests

jooq

kotlin

springboot

sql

test

Record the setup of a spring boot project with JOOQ.

June 7,

2024

Role and User Management in PostgreSQL

db-management

postgresql

sql

Record SQL script for managing db user persmissions

June 6,

2024

Record my IntelliJ IDE Setting

intelliJ

Just record a zip file that is customized for my own use.

May 31,

2024

Get the full type hint from VSCode

vscode

We try to get the full list of type hints in vscode instead of the shortened and incomplete ones.

May 23,

2024

Github Actions for Serverless Functions

aws

serverless

We record a github workflow for deploying serverless backends

May 9,

2024

Download the Entire CloudWatch Log

aws

We record how to download a complete record of cloudwatch logging inside a log-stream.

May 2,

2024

Self-Reflection on Database Schema Design

sql

Record some mistake that can be avoided when designing a database schema.

April 28,

2024

Kafka and Debezium with Everything Hosted Locally Without Confluent

debezium

kafka

In the past we have studied CDC with the help of confluent here. This time we host everything locally to prepare ourselves to host CDC without the dependency on confluent.

April 17,

2024

EAS Update

eas

expo

We discuss the technical detail of OTA (Over The Air) update which avoid unnecessary rebuild for updating a mobile application.

April 16,

2024

CRUD in DynamoDB

aws

Basic CRUD with DynamoDB

April 15,

2024

AWS CDK in Typescript and Python with ① Application in S3 ② Lambda Functions with Debugging ③ API Gateway and ④ DynamoDB

aws

We study how to create stack of aws resources and how they can interact with each other.

April 12,

2024

Create Custom Layer for Lambda Functions in Python

aws

We record a standard procedure to create a custom layer for 3rd party libraries.

April 4,

2024

More About Lambda Functions, Thottling, Concurrency, RDS-Proxy and Integration with CloudWatch Events

aws

We study the concurrency models in lambda functions to avoid unexpected failure.

March 29,

2024

Dev Container

docker

Sometimes a library may require user installing gcc, c++, ca-certificates, or some other linux-specific libraries. Let's use docker image to provide us a consistent working environment.

March 28,

2024

CDC in Confluent and Kafka

cdc

confluent

kafka

We study how to configure Debezium in Confluent to listen changes of database, which is very helpful in streaming real-time changes to frontend.

March 24,

2024

Read Write Lock for Go

go

We study read lock and write lock.

March 23,

2024

Channel and WaitGroup

go

Study how to share a SINGLE channel to all goroutines and how to close it properly.

March 15,

2024

Custom HashTable by Separate Chaining

data-structure

go

We implement a simple hashtable by using golang.

March 14,

2024

In-App Notification

sql

We record how to implement in-app notification which consequently makes caching much more effective and flexible.

March 3,

2024

Code Organization for RabbitMQ

message-broker

nodejs

rabbitMQ

The concept of message queue is easy but writing them can easily be messy due to boilerplate code, we summarize how to organize them into a MessageQueue class.

March 2,

2024

How to node your-script.js --env=some_env

nodejs

We record how to make custom script with custom predefined variable in command line via running js script.

March 1,

2024

SQL Functions to Generate created_at, human-readable created_at and updated_at

sql

Convenient simple functions to set as default in SQL.

February 28,

2024

As-With Clause in SQL

kysely

sql

Some simple yet common use case of as-with clause in sql, and its counterpart in kysely.

February 27,

2024

Javascript Compatible Timestamp in PostgreSQL

prisma

sql

The default Datetime object in PGSQL by default is UTC+0. Application-wise we wish an absolute timestamp that is compatible with frontend (i.e., javascript), here is how.

February 26,

2024

Use ULID in PostgreSQL

prisma

sql

We record how to use ULID instead of UUID in PostgreSQL

February 25,

2024

Forcefully Make Prisma Ignore The Changes in a Migrated .sql File

nodejs

prisma

sql

We record how to mark a manually changed sql file as a successful migration in prisma.

February 25,

2024

Prisma Migration Script

prisma

sql

Record the commonly used migration scripts in prisma and sql.

February 24,

2024

Declare Types for Non-typed 3rd Party Library

react

Record some exmaple how to declare types from library that is untyped.

February 20,

2024

Push-Notification in Android and iOS via Expo

expo

nodejs

react-native

We record how to implement push-notification in both Android and iOS.

January 27,

2024

Code Organization for Redis

nodejs

redis

We study how to organize code for redis caching in a nodejs project

January 26,

2024

Auto-hotkey Config for Vscode Switching (Windows)

autohotkey

We introduce how to alt+1, alt+2, ... to switch among different vscode window.

January 22,

2024

Github Action for Deployment on ECS Fargate

cicd

ecs

github-actions

We study how to automate the deployment process with containerized backend image.

January 21,

2024

Algolia Revisit

algolia

nodejs

react

searching

In the past we have discussed algolia backend and the corresponding frontend, revisit this topic with all code written in nodejs

January 16,

2024

Fundamentals of Github Actions

cicd

github-actions

Fundamental and basic use of github actions.

January 11,

2024

Advanced Combat Tracker (ACT): Scriping for Trigger

act

Record basic scriping for trigger.

January 6,

2024

Reverse Engineer PostgreSQL to Go Structs

go

jet

sql

Record the procedure the reverse-engineer a PGSQL database.

January 5,

2024

Minimal Code for Setting Prisma as Just a Table Migration tool

nodejs

prisma

Record minimal code needed to use prisma as just a table migration tool.

January 4,

2024

Paste Image in Markdown Files from Clipboard

md

vscode

We record a vscode configuration to customize where to save our image when pasting image in md files with automatically generated url inside markdown file.

December 24,

2023

Dead-letter and Delayed Queues

java

message-broker

rabbitMQ

springboot

Record the implementation of various queues and integrate them with springboot.

December 23,

2023

Run Kysely Generated SQL in Tableplus

sql

Record how to experiment with the sqls generated by query builder.

December 17,

2023

Springboot with JOOQ and Functional Endpoints

java

springboot

sql

We record the config for a starter project in springboot that mimics the workflow of prisma+kysely in nodejs world.

December 10,

2023

TSS as a Replacement of makeStyles

react

In recent mui makeStyles is no longer maintained, but css in tss is still a very good idea so we record an alternative in this article.

December 8,

2023

Manage Environment Varisbles for Dev, Uat, Prod in Springboot Using a Single Yaml File

java

springboot

Revisit the basic of springboot application.

November 30,

2023

Prisma, Prisma-Kysely and Kysely

nodejs

prisma

sql

Record and introduce the workflow of using prisma and kysely.

November 20,

2023

Frontend Retry Mechanism for Refresh Token

jwt

Record the actual implementation in frontend for fail > refresh access-token > retry.

November 19,

2023

Load Test and Stress Test by Gatling

gatling

test

Record the steps to run load and stress test.

November 17,

2023

Record Table Migration Script in SQL

sql

A list of sql script for copy and paste.

November 15,

2023

Load Environment Variables from Aws Secret Managers and Run Shell Script in Go for DB Schema Migration Based on DB_URL from Secrets

go

Methods of sharing environment variabls has always been being diversified. Some may just keep the .env in the project. Some may share it via instant messagers. Some may store it in a webpage that require a login. We share a method that should be a norm for intense aws users!

November 14,

2023

Two Stage Docker Image; Docker-Compose with Service B waiting for the Connection of Service A

docker

go

sql

We build smaller docker image by copying the only used components into new docker image

November 13,

2023

Complete Golang Project Structure

go

We record how to create a complete web application project in go.

November 12,

2023

Send Google Gmail Without Sendgrid

nodejs

We record how to make use of the native gmail api to send email without sendgrid which cost money with only 200 mails a date in free plan (while gmail api is free).

November 11,

2023

SQL for Table Migrations in the Course of Developement

sql

Record the workflow used in database migration, integrated with tools like dbdiagram.io.

November 6,

2023

Transactions, Go and sqlc

go

sql

Explore transactions with go and raw SQL

November 5,

2023

Make a Custom Swipable Item

react-native

Record a pan guesture component that can swipe an item to show hidden buttons

November 4,

2023

PostgreSQL Revisit

sql

Record the standard qureis in SQL.

November 3,

2023

Docker Containerization of Nodejs with Typescript

docker

nodejs

Containerization of vanilla JS project is simple, with typescript we need little more steps.

November 3,

2023

Custom Winston Logger in Nodejs

express

nodejs

Record my custom configuration of express logger

November 2,

2023

Mongoose way to Convert String into Objectid in Aggregation

mongoose

Just a syntax record

October 28,

2023

Goose and Sqlc for Database Migration and Query Function Generation

go

postgresql

sql

We record a workflow of using goose and sqlc to work with changes of database schema.

October 22,

2023

Push Data to Frontend by SSE via Event-Driven Approach with NO Short Polling

express

nodejs

react

react-native

SSE

In the past we have discussed SSE by kind of short polling in the backend (keep looping to see whether a key has message to pop out in redis queue). This time we send messages to frontend by listening subscriptions on EventEmitter, an approach very native to languages in which channel is implemented, like go and rust.

October 21,

2023

@gorhom/bottom-sheet

react-native

Though we are able to create a bottom-sheet on our own, we may as well use existing stable ones from others!

October 20,

2023

Generate Excel by Openpyxl

excel

openpyxl

python

We implement a reusable utility class that handle most of the jobs in creating excel programmatically.

October 17,

2023

Long Running Task in Background for Mobile App

react-native

We discuss How to keep running tasks even when our app is in background state (including screen being locked.

October 16,

2023

Terraform Modules

aws

cloud

terraform

How to make a modules and finally, we can apply pre-made modules from others!

October 15,

2023

Remote State for Whole Development Team; Create and Deploy on a Public Subnet of Custom VPC

aws

cloud

terraform

Let's split our code in multiple terraform files.

October 14,

2023

Conditions and Loops

aws

cloud

terraform

Standard technique to create multiple resources by conditions and loops.

October 13,

2023

Variables and Remote Execution

aws

cloud

terraform

We discuss how to manipulate variables (dev, prod, etc...) and plan a small section on remote execution of shell scripts once an instance is launched.

October 10,

2023

AWS Resources Instanciation in Terraform

aws

cloud

terraform

A study of Terraform basic building blocks.

October 9,

2023

Pin to Zoom Camera

react-native

Pin to zoom is very common but it is not an out-of-the-box feature for camera API in iOS. Let's bring it back on our own.

October 7,

2023

Audio Recording in React-Native

react-native

An introduction to react-native-audio-recorder-player

October 7,

2023

AWS Fargate: Let's Create Service and Run Task!

aws

ecs

There are two ways of deploying tasks in ECS fargate, by either "create service" or "run task", let's get hands on experience with both methods.

October 5,

2023

File Upload and Download Using Stream and FormData in React-Native

react-native

File streaming is a very basic technique to effectively transmit files from frontend to backend and, of course, from within backends as well.

October 3,

2023

Custom BottomSheet

react-native

We build our own bottom sheet.

October 1,

2023

Custom Toast Messages

react-native

We build our own toast meassages.

September 30,

2023

Expo-CLI for Development Build

expo

react-native

Record detail of expo-cli workflow in development build.

September 26,

2023

Clone a Swipable Page Inspired from Discord Mobile App

react-native

We can create our custom component by creating custom behaviour via customizing animtation!

September 24,

2023

Auto-Incremented Id for Mongo Collection

mongo

We create a special collection and a hook to the save method of a collection to obtain an auto-incremented id field!

September 23,

2023

Voice Chat on Mobile Using AgoraRTC

react-native

We create a voice chat application on mobile using third party API called AgoraRTC.

September 21,

2023

Create a left-join in Mongoose

mongo

We create an aggregation pipeline which acts like a left-join.

September 20,

2023

Scaling Websocket Chat Sever by Redis

nodejs

redis

web-socket

We study how to scale up the chat services horizontally by Redis.

September 19,

2023

Simple Chat Server in Rust via Telnet

rust

Study of Tokio by building a chat server.

September 16,

2023

Setup of Express with Socket.io with JWT Authentication Using Cookie

express

jwt

nodejs

web-socket

Basic reivew of API provided by socket.io-client and socket.io in nodejs.

September 15,

2023

Jest Fundamentals in TS

nodejs

test

Testings not only justify our functions are working, it also demonstrates how our function is used. We will be setting up tests in typescript.

September 11,

2023

Environment Variable by env-cmdrc

nodejs

Package for combining env files conveniently.

September 10,

2023

Rust Formatter Configruation

rust

Record a configuration for auto-formattting.

September 9,

2023

Elliptic Curve and Operator Overloading

rust

Let's define operator overloading on finite field Z/pZ for prime p.

September 5,

2023

Routing Schema for Frontend Project

react

In the browser our route consist of many information, like messageId, emailId, etc, which enables our web page to select correct data based on the route. In this post we discuss how to effectively construct such url with type safty under a RouteSchema.

September 4,

2023

Rust Study Notes

rust

This is a beginner notes.

September 3,

2023

Restrict CORS to Limited Origins

express

Record how to implement CORS in express to allow certain origins to get access instead of allowing all origins

September 3,

2023

Handle Streams in File-Responding Request

express

java

We implement file downloading feature in a get/post request in both express and spring, we handle them in memory and therefore no disk i/o is needed.

September 3,

2023

Server Sent Event in Java and Node.js Backend

express

java

nodejs

springboot

SSE

An introduction to how to effectively create SSE event mimicing the single-thread event loop adopted by nodejs in springboot, and how do we actually implement it in nodejs as well.

September 2,

2023

Scroll Up and Down Events in React

react

Record a hook to listen to scrolling-up and scrolling-down events, and an easy way to inject our callback to this events using this hook.

August 27,

2023

Box Shadow

react

Record a list of box-shadow in a json object.

August 15,

2023

Hook to Rerender Component By Making use of the Single Threaded Event Driven Model Behind v8

react

A simple component and function that helps rerender a component effectively.

August 14,

2023

forwardRef and useImperativeHandle

react

A method to pass complicated function in a component to its parent.

August 13,

2023

Google Login

express

A simple backend that perform google authentication. I personally use this to restrict users who can access my project.

August 11,

2023

Getting Path Parameters

react

I used to use useRouteMatch in react-router-dom v5, which has been changed completely in v6 into something called useMatch or useMatches. We talk about building the param searching function on our own to get rid of these unhandy "black boxes".

August 10,

2023

Snackbar Utils

react

Record a configuraton for snackbar utils.

August 3,

2023

Multi-Selections

react

Record a multi selection component.

July 30,

2023

Write Mongo Aggregation Using JSON in Springboot

java

mongo

We get rid of the headache of writing mongo agggregation using mongo apis in java given that we already know how to write that in json format.

July 27,

2023

Detect Click Outside

react

Record a hook for determining whether click have happened outside of our target dom element.

July 26,

2023

Build a Search Function

algolia

fusejs

java

react

searching

We introduce a service called Algolia which provides an easy search engine integration that helps build quick and accurate search functionality.

July 19,

2023

Serverless Flask and Serverless Express-ts

aws

serverless

Guide to creating serverless flask and express application.

July 17,

2023

Gmail and Inbox Push Notification

google-cloud

A introduction of OAuth2 Consent Setting and Pubsub for publishing update of gmail inbox.

July 10,

2023

Flask with Uwsgi and Docker Image Deployment

aws

react

A full breakdown of steps deploying a react project to s3 with SSL encryption.

July 8,

2023

Deployment of React Project Using S3 and Cloudfront

aws

react

A full breakdown of steps deploying a react project to s3 with SSL encryption.

July 5,

2023

Lazy Loading

react

Record the detailed implementation of lazy loading.

July 5,

2023

normalizr --- Convert Array of Data Into Hashmap

react

normalizr will be helpful if we have complex manipulation of data in an array (like drag and drop)

July 2,

2023

Remote Debugger for Spring Application

java

springboot

We list the standard procedures to debug a dockerized spring application.

June 28,

2023

.gitlab-ci.yml for Deploying Static Pages to S3

aws

cicd

gitlab

Record CICD script in gitlab for deploying react pages.

June 25,

2023

Docker Run Indefinitely

docker

We record a command to run an image in non-stop mode for debugging.

June 24,

2023

tsconfig.json

javascript

react

typescript

Record the latest tsconfig.json I use.

June 23,

2023

Revisit Docker and Gitlab-CI

cicd

docker

gitlab

Revisit the fundamentals of docker and the related gitlab-ci workflow.

June 21,

2023

Quick Step to Make a Reducer Persist its Data

react

redux

Record the use of redux-persist for persisting data in redux store.

June 21,

2023

Radio Buttons Group and General Dropdown List

react

Record the implementation of radio button group and dropdown list, in a hope that we don't need to waste time cooking it up again in the future.

June 20,

2023

Scrollbar Style Like Mac

react

Record CSS that makes scollbar look better.

June 20,

2023

Write Middlewares in Redux-Toolkit

react

We list sample usage of createThunkAction provided by redux-toolkit in order to single out the logic of data-fetching away from the UI component.

June 16,

2023

Use of dayjs

javascript

react

Just record the usual API that we may need in using dayjs.

June 15,

2023

Two Kinds of $lookup, and use Javascript in Advanced lookup

mongo

We demonstrate it by examples.

June 15,

2023

Inner and Left Joining Multiple Collections in Mongo --- The preserveNullAndEmptyArrays in $unwind

mongo

$lookup only works for the whole collection, we demonstrate how to left join another collection using a field which is an array.

June 13,

2023

Redux Slice Template

react

Record my template of redux slice.

June 10,

2023

List of Mongo Aggregation Pipelines that I have Used.

mongo

Record the mongo query using aggregation pipeline to get the results as in the title in my usual work. These include $lookup, $project, $arrayElemAt, {$gt: {$size: 1}}, etc interesting operations.

June 9,

2023

Mui CSS Animation with Keyframes

react

Record how to write CSS animation with keyframes in mui makeStyles.

May 27,

2023

Data Scrapping for Data that Requires Click by Click

python

selenium

Click and then get detail is a very routine practice for data scrapping. We record how to do that by selenium in python.

May 25,

2023

General Post Request in Springboot and File Uploading

java

react

springboot

We record the whole workflow of uploading file from react frontend to springboot backend.

April 6,

2023

C++

We study how to register a custom url protocol to launch our desktop application.

April 5,

2023

std::variant

C++

In typescript we have type A = B | C, we also have an analogue in C++.

April 2,

2023

Libtorch Study Notes With OpenCV

C++

libtorch

pytorch

In the course of translating pytorch model into libtorch model there are traps and tricks that are worthing being recorded. Also record the simple use of opencv as it substitutes the role of numpy in python.

March 27,

2023

Precompiled Header in CMake Project

C++

Previously we have mentioned how to use precompiled header in visual studio project, this time we record how to do it in cmake project

March 24,

2023

Control C++ Standard Accurately

C++

Sometimes the constant CMAKE_CXX_STANDARD does not guarantee the C++ standard we use in compilation. We add a line to guarantee which target is compiled in which C++ standard.

March 20,

2023

Install Opencv and Libtorch for CMake Project

C++

deep-learning

Record the flow of including opencv and libtorch into CMake project.

March 7,

2023

Diffusion Model Study

deep-learning

python

Beginning study of diffusion model.

February 27,

2023

Commuication Between Two Threads

C++

Discuss how two threads communiate with each other.

February 9,

2023

C++ Useful Util Functions Mimiced from Python, Regular Expression

C++

Record useful utility functions that I have found during my project on desktop app in C++.

January 28,

2023

C++ VSCode says it cannot open source file when it really can

C++

Fix the error of failing to find source files when it really can for cmake projects.

January 11,

2023

Pytorch/libtorch with CPP API

C++

pytorch

We discuss how to import models trained in pytorch into cpp project.

January 10,

2023

Socket Programming Fundamentals by winsock

C++

We create a simple TCP Server via winsock.

December 15,

2022

Simple Introduction to CMake Files

C++

Record some experience and the CMakeLists.txt's that I have used.

December 14,

2022

Gray Matter

coding

javascript

react

Record the usage of gray-matter.

December 12,

2022

TCP Server Fundamental

C++

A simple TCP server that receive and hold a connection.

December 9,

2022

Variadic Version of Print in C++

C++

Record a useful print function which behaves like console.log in javascript and print in python.

November 30,

2022

Iterator

C++

We try to implement iterator that iterates elements inside our vector class in the PREVIOUS BLOG.

November 29,

2022

Array and Dynamic Array Class

C++

We can avoid heap memory allocation by using C style array stored in stack memory, given that we don't need such a flexibility of dynamic array (the std::vector class) and the size of the target array is known beforehand.

November 27,

2022

Precompiled Header

C++

Record standard procedure to make a precompiled header file to avoid rebuilding it every time we build the whole project.

November 23,

2022

Copy Constructor

C++

We implement our string class and demonstrate how to create a copy constructor for deep copying a string.

November 16,

2022

Visual Studio Solution Configuration

C++

Record solution setup for cleaner folder structure

November 15,

2022

makeStyles for react-mui v5.0 and tss-react/mui

nextjs

react

The latest react mui version (v5.0) has breaking changes that make the config files in THIS POST fail. We discuss the corresponding necessary changes.

November 15,

2022

Redux Toolkit Quick Setup

react

Record the setup for redux toolkit.

November 15,

2022

Config Files Organization in Frontend Project (Next.js Specific)

nextjs

react

Record how to change pytorch model into onnx model and deploy it to frontend.

November 14,

2022

Onnx Model Deployment to Frontend From Pytorch Model

deep-learning

pytorch

react

Record how to change pytorch model into onnx model and deploy it to frontend.

November 6,

2022

RetinaFace

deep-learning

pytorch

Record the study of headless detector for face and landmarks.

November 5,

2022

Albumentations and Common Helper Functions with PyTorch

deep-learning

pytorch

Record the most recently used combination of data augmentations.

November 3,

2022

collate_fn in pytorch

deep-learning

pytorch

Discuss how to customize the patching of the results from __getitem__ method of Dataset object in pytorch.

October 8,

2022

CPP VSCode Configuration for 2020 Standard

C++

coding

Vscode task.json for using 2020 standard.

September 30,

2022

Cascade RCNN

deep-learning

Code implementation in pytorch on Cascade RCNN, the code base is mainly a modification of my previous project.

September 29,

2022

Squeeze and Excitation Network

deep-learning

pytorch

Record an attention-based component.

September 22,

2022

Gradient Clipping

deep-learning

pytorch

Record a script to clip gradient to avoid graident explosion.

September 20,

2022

Swin-Transformer Backbone in Faster RCNN

deep-learning

pytorch

Describe how to fed the features from Swin Transformer into Faster RCNN.

September 19,

2022

Swin Transformer with einops Implementation

deep-learning

pytorch

In practice we can directly use torchvision.models.swin_t to use pretrained model. The Reimplementation of the model is for sheer purpose of my self-learning.

September 7,

2022

Mesh Grid Trick

deep-learning

pytorch

Record a trick to create mesh grid coordinate.

September 6,

2022

Vision Transformer with einops Implementation

deep-learning

pytorch

Record the study of the basic mechanism how a transformer perform classification task on images.

August 18,

2022

Angular Fundamental

angular

Record the basic syntax in learning angular.

August 17,

2022

Transformer 2: A More in Depth Training with Real World Dataset Using Modern NLP Dataset Pipeline in Pytorch

deep-learning

pytorch

Continuation of the previous blog post on transformer, discuss more modern pipeline for training a transformer (or any NLP task in general) using the latest torchtext.

August 11,

2022

Transformer 1: The Model Definition and Naive Training Dataset for Machine Translation

deep-learning

pytorch

Record the basic building block and structure of a Transformer. The main reference of this article is this blog post.

July 22,

2022

Retrain Model when Nan Occurs

deep-learning

pytorch

Sometimes a loss becomes nan in rare occasion (e.g., it occurs once per 5~7 epochs), in this case we record a script to restart the training using the latest weight.

July 14,

2022

Feature Extractors

deep-learning

pytorch

Different backbones have different APIs for extracting features from an image, we record some of them.

June 30,

2022

Data Augmentation for Object Detection

deep-learning

pytorch

Record my data augmentation used in object detection.

June 27,

2022

Faster RCNN in PyTorch

deep-learning

pytorch

Minimal functioning implementation of faster rcnn (with and without fpn).

May 29,

2022

EAST: A Text Detection Algorithm

deep-learning

pytorch

Break down the code for my own study. e.g., the way it does image cropping, resizing, etc augmentations while keeping the vertice consistent are very valuable reference, the way it does nms is also succint! Source Code: Link

May 28,

2022

CV2, Pillow and shapely for Polygons

python

Record usual api for drawing apis in cv2, pillow and shapely.geometry.

May 28,

2022

Masks in Numpy and Retrive Corresponding Indexes

python

Mask in numpy, tensorflow and pytorch is very useful to filter out desired values, but sometimes the index that the mask marks as True is also import, we record a function to retrive those information.

May 27,

2022

tqdm: The Progress bar for Iterations

python

Record the use of a progress bar when using for loop, and record how to customize the output.

May 23,

2022

Logging Without Printing New Lines

python

Create a console log that freeze the position but keep updating the numerics in training.

May 23,

2022

Pytorch Fundamentals

deep-learning

pytorch

Record useful tools and commands in pytorch which I learn from translating DefectGAN model in tensorflow into pytorch.

May 20,

2022

Shell Script Fundamentals

coding

Record the basic knowledge we need to know in shell script.

May 15,

2022

Install tensorflow-gpu

deep-learning

Record standard step to set up tensorflow-gpu, cuda and cudnn.

May 9,

2022

DefectGAN

deep-learning

tensorflow

Some implementation on DefectGAN.

April 30,

2022

Singular Value Decomposition

math

We discuss the rigorous proof of Singular Value Decomposition.

April 30,

2022

Principal Component Analysis

math

Before I forget, record my basic understanding on what is Principal Component Analysis and how it works.

April 29,

2022

Pandoc

latex

Write latex document using markdown in vscode.

April 22,

2022

WGAN and WGAN-GP

deep-learning

We modify from DCGAN from this post to experiment on WGAN by weight-clipping, then we try to experiment WGAN-GP using gradient penality.

April 22,

2022

YOLOv3 Deep Dive

deep-learning

I made this study note on 5th of March, 2021: 2021-02-28-yolov3-algorithm-drilling in pdf format. For a refresh of memory I reformat the content into an md file for this blog for easy reading and reference.

April 20,

2022

Different Kinds of Residue Blocks

deep-learning

Record different kinds of residue block that I have seen.

April 19,

2022

Element that has Default Fade-in Transition on Mount

react

Record an element that I made for default fade-in effect on component mount and state change.

April 16,

2022

Discrete Cosine Transform and JPEG Compression Implementation in Python

math

python

Discrete Cosine Transform (DCT) is not only used in image processing, it is also used in signal processing of sound such as computing MFCC coefficients as a feature vector.

April 15,

2022

Summarize Rust Beginning Tutorial by a Simplified Multithreading Web Server

rust

Completed the Rust official tutorial from its online book and concluded the 20 chapters by a multithreading simple web server. Try to record the subtitle detail for future reference.

April 12,

2022

SSH Config and Download File in SSH Client

coding

Record script to download files using SSH client.

April 11,

2022

Typescript Debugger Config

typescript

Set up debugger for node project in typescript with minimal config.

April 7,

2022

pathlib and json in Python

python

Record simple usages of pathlib and json in python

April 4,

2022

Countability of a kind of Discontinuity

math

Record a mathematical problem from facebook.

April 1,

2022

Color-GAN with Auto-Coloring on Animated Gray Images

deep-learning

tensorflow

We study one of the GANs that can perform auto coloring to gray-images.

March 30,

2022

Ctrl- and Middle-Clickable Button; Method to Scroll to Target Element

react

Record how to make button left- and middle-clickable; Record how to scroll to desired HTML element vertically.

March 29,

2022

Mongoose in Nodejs and Mongoengine in Python

mongo

nodejs

python

Record everything we should know about mongo with mongoose in nodejs and mongoengine in python.

March 25,

2022

Cycle-GAN in Tensorflow

deep-learning

tensorflow

Implement custom training loop in tensorflow for Cycle-GAN without the use of trick that sets Model.training = False.

March 24,

2022

Two Methods to Read Excel Files in Python

excel

python

Record two methods to read excel files in Python

March 23,

2022

Type Annotation Record in Typescript for 3rd-party Library and Usual Import

typescript

Record some type annotation used for third party untyped library or those needed in importing files (like pdf).

March 23,

2022

Resulting Shapes of Conv-net by Direct Experiment

deep-learning

From time to time it is easy to forget the formula to calculate the output shape of Conv2D, MaxPooling2D, etc, layers. Record some sample code to test shapes easily.

March 21,

2022

Simple Introduction to mAP and F1-Score

deep-learning

Explain why and how it can provides a performance index to a localization algorithm.

March 13,

2022

Additional Configuration for makeStyles in Next js

nextjs

react

Without additional setup, due to the natural of serverside rendering, some styles would become undefined and we study how to avoid them.

March 10,

2022

How to Configure S3 to Send SQS Message

aws

Resource that explains how to construct correct policy on sqs-queue to allow s3 to send message to that queue.

March 7,

2022

Customized Logger in Python and Javascript

javascript

python

Logging is undoubtedly the most important part of an application. We study how to create a logger that tells us a message comes from which file and which line.

February 25,

2022

Set up Local Environment for Mongo using Docker

coding

mongo

Since I have docker already installed, just record a few steps to get the local development ready using mongo db with authentication.

February 15,

2022

GAN and DCGAN in Tensorflow

deep-learning

tensorflow

In the past I have learnt the most basic GAN and DCGAN using pytorch, but I am more familiar with tensorflow. I attempt to understand gradient tape using by using these two GAN again with mnist dataset.

January 4,

2022

Python Generator

python

A simple remark on generator in python.

December 16,

2021

Nodejs Image Compression in Backend with sharp

javascript

We discuss how to compress images effectively in nodejs backend.

December 11,

2021

Call Python Script in Node.js

javascript

Sometimes we would like to call python script inside our node js program. We discuss how to execute a command, how to read the log and error in node-js and how to wait until this process is done.

December 6,

2021

Nextjs with Electron

electron

google-cloud

nextjs

react

Since FF14 has a new patch of huge update, in order to take this oppurtunity to learn Japanese I decided to write a desktop application again.

December 2,

2021

Debug Nextjs with Typescript in VSCode

javascript

nextjs

react

I am working on a desktop application by using Electron + Next.js in typescript. Since I cannot live without debugger, I have spent time searching debug config in the internet and finally come into a functioning configuration!

December 1,

2021

Debug Javascript For Individual File

javascript

Sometimes it is convenient to directly execute a test file to understand how the project work. In the past we discussed how to make a runnable test file in python, in javascript we can use exactly the same approach for debugging.

November 20,

2021

Datetime Object in Python

python

Record a list of useful utility functions in handling date-time object in python.

October 26,

2021

Decompose a Class into Separate Files

python

When a class grows to 500 to 1000 lines of codes it becomes hard and tedious to maintain because there starts to be many class methods, some are short, some are huge. It is a good starting point to separate particularly long functions from the class with very very minimal effort, but how?

October 25,

2021

Python Debugger with Project Directory Included in sys.path

python

Record a structure that makes a test_sth.py debuggable and make sure project directory path is included in sys.path when debug mode is enabled.

October 20,

2021

How to Include Project Directory in sys.path when Running pytest

python

As titled.

October 17,

2021

Auto Hotkey Record

autohotkey

coding

A record of my latest autohotkey setup for different applications.

October 2,

2021

Automation Task for Chrome

python

selenium

Simple click and download (and wait for its completion!) tasks that are achieved in python.

October 1,

2021

First to Leave Problem of Elevator

math

Simple problem inspired by my life.

September 18,

2021

Live2D 心得

live2D

紀錄一些坑跟還有記憶的步驟

September 10,

2021

template

latex

Record some starting template that I made in the past.

September 9,

2021

C++ Beginner Notes 02 - Shallow Copy, Deep Copy and Move Semantics

C++

We list some potential problem of using shallow copy and how to avoid them by deep copy. We can also improve computation efficiency when a variable is never reused but needed to be passed into a function/class attribute. We achieve this by using move constructor.

September 8,

2021

C++ Beginner Notes 01 - Stack and Heap

C++

We write simple functions to understand cpp syntax, memory in stack and memory in heaps by using raw pointer.

August 30,

2021

Scrapping Images with Selenium and Beautifulsoup on Chrome

python

Record a flow of data-scrapping.

August 29,

2021

Image Augmentation with Custom Dataset Pipeline

tensorflow

We introduce albumentations for image augmentation that helps generalize our model to unknown data.

August 28,

2021

Transfer Learning Based on VGG-16

deep-learning

tensorflow

In classification tasks there are already state-of-the-art models trained from a myriad of images. We try to make a network surgery on one of them (VGG-16 this time) to quickly classifiy our custom dataset with good result.

August 28,

2021

Tensorflow Callbacks and Restart Training Process Based on Past Epoches

tensorflow

Introduce useful callbacks that I use in monitoring training process. Also introduce how to retrain the model from a specific epoach.

August 27,

2021

Comprehensive List of SVG Icon Available in React

react

Record a useful package that contains a comprehensive list of svg icons.

August 27,

2021

Colab Setting

coding

deep-learning

python

Basic command needed to mount a google drive and also to unzip compressed large dataset.

August 26,

2021

Dataset Pipeline and Custom Training in Tensorflow

tensorflow

Revist Tensorflow with a complete dataset pipeline using tf.data.Dataset and custom training via cycle-GAN.

August 20,

2021

The Set Equality for Convex Set

math

Record a proof to this result in convex analysis.

August 20,

2021

Understand Pytorch via GAN

pytorch

Understand Pytorch by implementing a simple version of GAN and also a DCGAN.

August 19,

2021

Computational Example on Probability Distribution

math

Given and its (joint) distribution on , we try to find the distribution of on .

August 14,

2021

Make your React App Scrapable by Google Search Engine

react

SPAs (Single Page App) are known to be unfriendly to search engines and we try to deal with this problem for react app.

August 12,

2021

Disqus Comment Plug-in in React

react

An introduction to disqus that plug a small comment box into our website.

August 11,

2021

Convex Analysis - More on Convex Functions and Characterize Convex lsc Functions by Biconjugate Functionals

math

Digest the material from Convex Analysis by Gert Wanka. I modify the proofs by quite a bit once there is available theorem in my mind that is not presented in the text or I have alternative proofs from other material.

August 8,

2021

Last Edit:

`$August $10`

2021

Convex Analysis - Characterization of Convex lsc Functions

math

Start the journey of convex analysis in an attempt to fully understand the technical tools needed in optimal transport. In this article we digest the material from Chapter 5 Convex Optimization in Function Space, I also fill in the important missing detail in the proof (the author is a bit rushing).

July 26,

2021

Exercises on Algorithms

algorithm

A note to solved problems and calculate the related time complexities.

July 25,

2021

Algorithm: Merge Sort and its Time Complexity

algorithm

Study the implementation of sorting algorithms

July 19,

2021

On Redux-Saga

react

Record a functional Redux-Saga setup.

July 18,

2021

Useful Conda Commands

python

Record the common commands that is needed to create a new virtual environment with conda.

July 15,

2021

Review of Basic Statistics

math

Review the definition of distribution, multivariable normal distribution, and some concrete examples.

July 14,

2021

Typescript Type Tricks

typescript

Useful custom type and last resort to get correct type that we may encounter in typescript.

July 13,

2021

On Looping all Files in Frontend

react

In backend we can loop through the files inside a directory using fs.readdir, we introduce a function that can achive the same thing in frontend using webpack's require.context function.

July 12,

2021

Lazy React Router

react

Inside a routed component, we introduce useRouteMatch on type annotation and the way to extract params.

July 10,

2021

Study Notes on Distribution and Latent Variable Model in Vartiaonal Auto Encoder

deep-learning

math

Study the formulas on conditioned joint-distribution such as relation between , and . We also discuss the latent variable model in variational Auto Encoder.

May 29, 2022

EAST: A Text Detection Algorithm

deep-learning

pytorch

Results

dataset.py

Import

1from shapely.geometry import Polygon
2from abc import ABC, abstractmethod
3from torch.utils.data import Dataset
4from PIL import Image
5from torchvision.transforms import transforms
6import torch
7import os
8import numpy as np
9import math
10import cv2

CustomDataset

1class CustomDataset(Dataset):
2    def __init__(self, img_path, gt_path, scale=0.25, length=512):
3        super(CustomDataset, self).__init__()
4        self.img_files = []
5        for img_file in sorted(os.listdir(img_path)):
6            if img_file.endswith(".jpg") or img_file.endswith(".png"):
7                self.img_files.append(os.path.join(img_path, img_file))
8
9        self.gt_files = []
10        for gt_file in sorted(os.listdir(gt_path)):
11            if gt_file.endswith(".txt"):
12                self.gt_files.append(os.path.join(gt_path, gt_file))
13
14        self.scale = scale
15        self.length = length
16
17    def __getitem__(self, index):
18        with open(self.gt_files[index], 'r', encoding="utf-8") as f:
19            lines = f.readlines()
20        vertices, labels = extract_vertices(lines)
21
22        img = Image.open(self.img_files[index])
23        img, vertices = adjust_height(img, vertices)
24        img, vertices = rotate_img(img, vertices)
25        img, vertices = crop_img(img, vertices, labels, self.length)
26        transform = transforms.Compose([transforms.ColorJitter(0.5, 0.5, 0.5, 0.25),
27                                        transforms.ToTensor(),
28                                        transforms.Normalize(mean=(0.5, 0.5, 0.5), std=(0.5, 0.5, 0.5))])
29
30        score_map, geo_map, ignored_map = get_score_geo(img, vertices, labels, self.scale, self.length)
31        return transform(img), score_map, geo_map, ignored_map
32
33    def __len__(self):
34        return len(self.img_files)

extract_vertices

1def extract_vertices(lines):
2    '''extract vertices info from txt lines
3    Input:
4            lines   : list of string info
5    Output:
6            vertices: vertices of text regions <numpy.ndarray, (n,8)>
7            labels  : 1->valid, 0->ignore, <numpy.ndarray, (n,)>
8    '''
9    labels = []
10    vertices = []
11    for line in lines:
12        vertices.append(list(map(int, line.rstrip('\n').lstrip('\ufeff').split(',')[:8])))
13        label = 0 if '###' in line else 1
14        labels.append(label)
15    return np.array(vertices), np.array(labels)

adjust_height

1def adjust_height(img, vertices, ratio=0.2):
2    '''adjust height of image to aug data
3    Input:
4            img         : PIL Image
5            vertices    : vertices of text regions <numpy.ndarray, (n,8)>
6            ratio       : height changes in [0.8, 1.2]
7    Output:
8            img         : adjusted PIL Image
9            new_vertices: adjusted vertices
10    '''
11    ratio_h = 1 + ratio * (np.random.rand() * 2 - 1)
12    old_h = img.height
13    new_h = int(np.around(old_h * ratio_h))
14    img = img.resize((img.width, new_h), Image.BILINEAR)  # PIL api (caution, widthxheight)
15
16    new_vertices = vertices.copy()
17    if vertices.size > 0:
18        new_vertices[:, [1, 3, 5, 7]] = vertices[:, [1, 3, 5, 7]] * (new_h / old_h)
19    return img, new_vertices

rotate_img

1def rotate_img(img, vertices, angle_range=10):
2    '''rotate image [-10, 10] degree to aug data
3    Input:
4            img         : PIL Image
5            vertices    : vertices of text regions <numpy.ndarray, (n,8)>
6            angle_range : rotate range
7    Output:
8            img         : rotated PIL Image
9            new_vertices: rotated vertices
10    '''
11    center_x = (img.width - 1) / 2
12    center_y = (img.height - 1) / 2
13    angle = angle_range * (np.random.rand() * 2 - 1)  # from -10 to 10
14    img = img.rotate(angle, Image.BILINEAR)             # PIL api
15    new_vertices = np.zeros(vertices.shape)
16    for i, vertice in enumerate(vertices):
17        new_vertices[i, :] = rotate_vertices(vertice, -angle / 180 * math.pi, np.array([[center_x], [center_y]]))
18    return img, new_vertices

rotate_vertices

1def rotate_vertices(vertices, theta, anchor=None):
2    '''rotate vertices around anchor
3    Input:
4            vertices: vertices of text region <numpy.ndarray, (8,)>
5            theta   : angle in radian measure
6            anchor  : fixed position during rotation
7    Output:
8            rotated vertices <numpy.ndarray, (8,)>
9    '''
10    v = vertices.reshape((4, 2)).T
11    if anchor is None:
12        anchor = v[:, :1]
13    rotate_mat = get_rotate_mat(theta)
14    res = np.dot(rotate_mat, v - anchor)
15    return (res + anchor).T.reshape(-1)

get_rotate_mat

1def get_rotate_mat(theta):
2    '''positive theta value means rotate clockwise'''
3    return np.array([[math.cos(theta), -math.sin(theta)], [math.sin(theta), math.cos(theta)]])

crop_img

1def crop_img(img, vertices, labels, length):
2    '''crop img patches to obtain batch and augment
3    Input:
4            img         : PIL Image
5            vertices    : vertices of text regions <numpy.ndarray, (n,8)>
6            labels      : 1->valid, 0->ignore, <numpy.ndarray, (n,)>
7            length      : length of cropped image region
8    Output:
9            region      : cropped image region
10            new_vertices: new vertices in cropped region
11    '''
12    h, w = img.height, img.width
13    # confirm the shortest side of image >= length
14    if h >= w and w < length:
15        img = img.resize((length, int(h * length / w)), Image.BILINEAR)
16    elif h < w and h < length:
17        img = img.resize((int(w * length / h), length), Image.BILINEAR)
18    ratio_w = img.width / w
19    ratio_h = img.height / h
20    assert(ratio_w >= 1 and ratio_h >= 1)
21
22    new_vertices = np.zeros(vertices.shape)
23    if vertices.size > 0:
24        new_vertices[:, [0, 2, 4, 6]] = vertices[:, [0, 2, 4, 6]] * ratio_w
25        new_vertices[:, [1, 3, 5, 7]] = vertices[:, [1, 3, 5, 7]] * ratio_h
26
27    # find random position
28    remain_h = img.height - length
29    remain_w = img.width - length
30    flag = True
31    cnt = 0
32    while flag and cnt < 1000:
33        cnt += 1
34        start_w = int(np.random.rand() * remain_w)
35        start_h = int(np.random.rand() * remain_h)
36        flag = is_cross_text([start_w, start_h], length, new_vertices[labels == 1, :])
37    box = (start_w, start_h, start_w + length, start_h + length)
38    region = img.crop(box)
39    if new_vertices.size == 0:
40        return region, new_vertices
41
42    new_vertices[:, [0, 2, 4, 6]] -= start_w
43    new_vertices[:, [1, 3, 5, 7]] -= start_h
44    return region, new_vertices

is_cross_text

1def is_cross_text(start_loc, length, vertices):
2    '''check if the crop image crosses text regions
3    Input:
4            start_loc: left-top position
5            length   : length of crop image
6            vertices : vertices of text regions <numpy.ndarray, (n,8)>
7    Output:
8            True if crop image crosses text region
9    '''
10    if vertices.size == 0:
11        return False
12    start_w, start_h = start_loc
13    a = np.array([start_w, start_h, start_w + length, start_h,
14                  start_w + length, start_h + length, start_w, start_h + length]).reshape((4, 2))
15    p1 = Polygon(a).convex_hull
16    epsilon = 1e-6
17    for vertice in vertices:
18        p2 = Polygon(vertice.reshape((4, 2))).convex_hull
19        inter = p1.intersection(p2).area
20        if 0.01 <= inter / (p2.area + epsilon) <= 0.99:
21            return True
22    return False

get_score_geo

1def get_score_geo(img, vertices, labels, scale, length):
2    '''generate score gt and geometry gt
3    Input:
4            img     : PIL Image
5            vertices: vertices of text regions <numpy.ndarray, (n,8)>
6            labels  : 1->valid, 0->ignore, <numpy.ndarray, (n,)>
7            scale   : feature map / image
8            length  : image length
9    Output:
10            score gt, geo gt, ignored
11    '''
12    score_map = np.zeros((int(img.height * scale), int(img.width * scale), 1), np.float32)
13    geo_map = np.zeros((int(img.height * scale), int(img.width * scale), 5), np.float32)
14    ignored_map = np.zeros((int(img.height * scale), int(img.width * scale), 1), np.float32)
15
16    index = np.arange(0, length, int(1 / scale))
17    index_x, index_y = np.meshgrid(index, index)
18    ignored_polys = []
19    polys = []
20
21    for i, vertice in enumerate(vertices):
22        if labels[i] == 0:
23            ignored_polys.append(np.around(scale * vertice.reshape((4, 2))).astype(np.int32))
24            continue
25
26        poly = np.around(scale * shrink_poly(vertice).reshape((4, 2))).astype(np.int32)  # scaled & shrinked
27        polys.append(poly)
28        temp_mask = np.zeros(score_map.shape[:-1], np.float32)
29        cv2.fillPoly(temp_mask, [poly], 1)
30
31        theta = find_min_rect_angle(vertice)
32        rotate_mat = get_rotate_mat(theta)
33
34        rotated_vertices = rotate_vertices(vertice, theta)
35        x_min, x_max, y_min, y_max = get_boundary(rotated_vertices)
36        rotated_x, rotated_y = rotate_all_pixels(rotate_mat, vertice[0], vertice[1], length)
37
38        # given p in Polygon(vertice), top_distance = p - Pr_{top_L}(p) = r(p)_y - r(Pr_{top_L}(p))_y = r(p)_y - ymin
39        # where r is the rotation anchored at the top-left corner
40        # and p in Polygon(vertice) only if r(p)_y - ymin >= 0
41        # d1 = distance from top to point (j, i)
42
43        # the gt is top, bottom, left, right (上, 下, 左, 右)
44        d1 = rotated_y - y_min
45        d1[d1 < 0] = 0
46        d2 = y_max - rotated_y
47        d2[d2 < 0] = 0
48        d3 = rotated_x - x_min
49        d3[d3 < 0] = 0
50        d4 = x_max - rotated_x
51        d4[d4 < 0] = 0
52        geo_map[:, :, 0] += d1[index_y, index_x] * temp_mask
53        geo_map[:, :, 1] += d2[index_y, index_x] * temp_mask
54        geo_map[:, :, 2] += d3[index_y, index_x] * temp_mask
55        geo_map[:, :, 3] += d4[index_y, index_x] * temp_mask
56        geo_map[:, :, 4] += theta * temp_mask
57
58    cv2.fillPoly(ignored_map, ignored_polys, 1)
59    cv2.fillPoly(score_map, polys, 1)
60    return torch.Tensor(score_map).permute(2, 0, 1), torch.Tensor(geo_map).permute(2, 0, 1), torch.Tensor(ignored_map).permute(2, 0, 1)

shrink_poly

1def shrink_poly(vertices, coef=0.3):
2    '''shrink the text region
3    Input:
4            vertices: vertices of text region <numpy.ndarray, (8,)>
5            coef    : shrink ratio in paper
6    Output:
7            v       : vertices of shrinked text region <numpy.ndarray, (8,)>
8    '''
9    x1, y1, x2, y2, x3, y3, x4, y4 = vertices
10    r1 = min(cal_distance(x1, y1, x2, y2), cal_distance(x1, y1, x4, y4))
11    r2 = min(cal_distance(x2, y2, x1, y1), cal_distance(x2, y2, x3, y3))
12    r3 = min(cal_distance(x3, y3, x2, y2), cal_distance(x3, y3, x4, y4))
13    r4 = min(cal_distance(x4, y4, x1, y1), cal_distance(x4, y4, x3, y3))
14    r = [r1, r2, r3, r4]
15
16    # obtain offset to perform move_points() automatically
17    if cal_distance(x1, y1, x2, y2) + cal_distance(x3, y3, x4, y4) > \
18            cal_distance(x2, y2, x3, y3) + cal_distance(x1, y1, x4, y4):
19        offset = 0  # two longer edges are (x1y1-x2y2) & (x3y3-x4y4)
20    else:
21        offset = 1  # two longer edges are (x2y2-x3y3) & (x4y4-x1y1)
22
23    v = vertices.copy()
24    v = move_points(v, 0 + offset, 1 + offset, r, coef)

The movement is always parellel to the edges.
In each move_points, two adjacent vectice will be pushed towards each other.
Each vertex will be adjusted twice in two directions in order to move towards center.

25    v = move_points(v, 2 + offset, 3 + offset, r, coef)
26    v = move_points(v, 1 + offset, 2 + offset, r, coef)
27    v = move_points(v, 3 + offset, 4 + offset, r, coef)
28    return v

find_min_rect_angle

1def find_min_rect_angle(vertices):
2    '''find the best angle to rotate poly and obtain min rectangle
3    Input:
4            vertices: vertices of text region <numpy.ndarray, (8,)>
5    Output:
6            the best angle <radian measure>
7    '''
8    angle_interval = 1
9    angle_list = list(range(-90, 90, angle_interval))
10    area_list = []
11    for theta in angle_list:
12        rotated = rotate_vertices(vertices, theta / 180 * math.pi)
13        x1, y1, x2, y2, x3, y3, x4, y4 = rotated
14        temp_area = (max(x1, x2, x3, x4) - min(x1, x2, x3, x4)) * \
15            (max(y1, y2, y3, y4) - min(y1, y2, y3, y4))
16        area_list.append(temp_area)
17
18    sorted_area_index = sorted(list(range(len(area_list))), key=lambda k: area_list[k])
19    min_error = float('inf')
20    best_index = -1
21    rank_num = 10
22    # find the best angle with correct orientation
23    for index in sorted_area_index[:rank_num]:
24        rotated = rotate_vertices(vertices, angle_list[index] / 180 * math.pi)
25        temp_error = cal_error(rotated)
26        if temp_error < min_error:
27            min_error = temp_error
28            best_index = index
29    return angle_list[best_index] / 180 * math.pi

cal_distance

1def cal_distance(x1, y1, x2, y2):
2    '''calculate the Euclidean distance'''
3    return math.sqrt((x1 - x2)**2 + (y1 - y2)**2)

get_boundary

1def get_boundary(vertices):
2    '''get the tight boundary around given vertices
3    Input:
4            vertices: vertices of text region <numpy.ndarray, (8,)>
5    Output:
6            the boundary
7    '''
8    x1, y1, x2, y2, x3, y3, x4, y4 = vertices
9    x_min = min(x1, x2, x3, x4)
10    x_max = max(x1, x2, x3, x4)
11    y_min = min(y1, y2, y3, y4)
12    y_max = max(y1, y2, y3, y4)
13    return x_min, x_max, y_min, y_max

rotate_all_pixels

1def rotate_all_pixels(rotate_mat, anchor_x, anchor_y, length):
2    '''get rotated locations of all pixels for next stages
3    Input:
4            rotate_mat: rotatation matrix
5            anchor_x  : fixed x position
6            anchor_y  : fixed y position
7            length    : length of image
8    Output:
9            rotated_x : rotated x positions <numpy.ndarray, (length,length)>
10            rotated_y : rotated y positions <numpy.ndarray, (length,length)>
11    '''
12    x = np.arange(length)
13    y = np.arange(length)
14    x, y = np.meshgrid(x, y)
15    x_lin = x.reshape((1, x.size))
16    y_lin = y.reshape((1, x.size))
17    coord_mat = np.concatenate((x_lin, y_lin), 0)
18    rotated_coord = np.dot(rotate_mat, coord_mat - np.array([[anchor_x], [anchor_y]])) + \
19        np.array([[anchor_x], [anchor_y]])
20    rotated_x = rotated_coord[0, :].reshape(x.shape)
21    rotated_y = rotated_coord[1, :].reshape(y.shape)
22    return rotated_x, rotated_y

move_points

1def move_points(vertices, index1, index2, r, coef):
2    '''move the two points to shrink edge
3    Input:
4            vertices: vertices of text region <numpy.ndarray, (8,)>
5            index1  : offset of point1
6            index2  : offset of point2
7            r       : [r1, r2, r3, r4] in paper
8            coef    : shrink ratio in paper
9    Output:
10            vertices: vertices where one edge has been shinked
11    '''
12    index1 = index1 % 4
13    index2 = index2 % 4
14    x1_index = index1 * 2 + 0
15    y1_index = index1 * 2 + 1
16    x2_index = index2 * 2 + 0
17    y2_index = index2 * 2 + 1
18
19    r1 = r[index1]
20    r2 = r[index2]
21    length_x = vertices[x1_index] - vertices[x2_index]
22    length_y = vertices[y1_index] - vertices[y2_index]
23    length = cal_distance(vertices[x1_index], vertices[y1_index], vertices[x2_index], vertices[y2_index])
24    if length > 1:
25        ratio = (r1 * coef) / length
26        vertices[x1_index] += ratio * (-length_x)
27        vertices[y1_index] += ratio * (-length_y)
28        ratio = (r2 * coef) / length
29        vertices[x2_index] += ratio * length_x
30        vertices[y2_index] += ratio * length_y
31    return vertices

cal_error

1def cal_error(vertices):
2    '''default orientation is x1y1 : left-top, x2y2 : right-top, x3y3 : right-bot, x4y4 : left-bot
3    calculate the difference between the vertices orientation and default orientation
4    Input:
5            vertices: vertices of text region <numpy.ndarray, (8,)>
6    Output:
7            err     : difference measure
8    '''
9    x_min, x_max, y_min, y_max = get_boundary(vertices)
10    x1, y1, x2, y2, x3, y3, x4, y4 = vertices
11    err = cal_distance(x1, y1, x_min, y_min) + cal_distance(x2, y2, x_max, y_min) + \
12        cal_distance(x3, y3, x_max, y_max) + cal_distance(x4, y4, x_min, y_max)
13    return err

losses.py

Import

1import torch
2import torch.nn as nn

get_dice_loss

1def get_dice_loss(gt_score, pred_score):
2    inter = torch.sum(gt_score * pred_score)
3    union = torch.sum(gt_score) + torch.sum(pred_score) + 1e-5
4    return 1. - (2 * inter / union)

get_geo_loss

1def get_geo_loss(gt_geo, pred_geo):
2    d1_gt, d2_gt, d3_gt, d4_gt, angle_gt = torch.split(gt_geo, 1, 1)
3    d1_pred, d2_pred, d3_pred, d4_pred, angle_pred = torch.split(pred_geo, 1, 1)
4    area_gt = (d1_gt + d2_gt) * (d3_gt + d4_gt)
5    area_pred = (d1_pred + d2_pred) * (d3_pred + d4_pred)
6    w_union = torch.min(d3_gt, d3_pred) + torch.min(d4_gt, d4_pred)
7    h_union = torch.min(d1_gt, d1_pred) + torch.min(d2_gt, d2_pred)
8    area_intersect = w_union * h_union
9    area_union = area_gt + area_pred - area_intersect
10    iou_loss_map = -torch.log((area_intersect + 1.0) / (area_union + 1.0))
11    angle_loss_map = 1 - torch.cos(angle_pred - angle_gt)
12    return iou_loss_map, angle_loss_map

Loss

1class Loss(nn.Module):
2    def __init__(self, weight_angle=10):
3        super(Loss, self).__init__()
4        self.weight_angle = weight_angle
5
6    def forward(self, gt_score, pred_score, gt_geo, pred_geo, ignored_map):
7        if torch.sum(gt_score) < 1:
8            return torch.sum(pred_score + pred_geo) * 0
9
10        classify_loss = get_dice_loss(gt_score, pred_score * (1 - ignored_map))
11        iou_loss_map, angle_loss_map = get_geo_loss(gt_geo, pred_geo)
12
13        angle_loss = torch.sum(angle_loss_map * gt_score) / torch.sum(gt_score)
14        iou_loss = torch.sum(iou_loss_map * gt_score) / torch.sum(gt_score)
15        geo_loss = self.weight_angle * angle_loss + iou_loss
16        return geo_loss + classify_loss

train.py

import

1import torch
2from torch.utils import data
3from torch import nn
4from torch.optim import lr_scheduler
5from dataset import CustomDataset
6from detect import performance_check
7from models import EAST
8from losses import Loss
9from tqdm import tqdm
10from device import device
11from utils import ConsoleLog
12import os
13import time

train

1console_log = ConsoleLog(lines_up_on_end=1)
2def train(train_img_path, train_gt_path, pths_path, batch_size, lr, num_workers, epoch_iter, interval):
3    file_num = len(os.listdir(train_img_path))
4    trainset = CustomDataset(train_img_path, train_gt_path)
5    train_loader = data.DataLoader(trainset,
6                                   batch_size=batch_size,
7                                   shuffle=True,
8                                   num_workers=num_workers,
9                                   drop_last=True)
10
11    criterion = Loss()
12    model = EAST()
13    data_parallel = False
14
15    if torch.cuda.device_count() > 1:
16        model = nn.DataParallel(model)
17        data_parallel = True
18
19    model.to(device)
20    optimizer = torch.optim.Adam(model.parameters(), lr=lr)
21    scheduler = lr_scheduler.MultiStepLR(optimizer,
22                                         milestones=[epoch_iter // 2],
23                                         gamma=0.1)
24
25    for epoch in range(epoch_iter):
26        model.train()
27        epoch_loss = 0
28        epoch_time = time.time()
29
30        for batch, (img, gt_score, gt_geo, ignored_map) in enumerate(tqdm(
31            train_loader,
32            total=len(trainset) // batch_size,
33            bar_format="{desc}: {percentage:.1f}%|{bar:15}| {n}/{total_fmt} [{elapsed}, {rate_fmt}{postfix}]"
34        )):
35            start_time = time.time()
36
37            img = img.to(device)
38            gt_score = gt_score.to(device)
39            gt_geo = gt_geo.to(device)
40            ignored_map = ignored_map.to(device)
41
42            pred_score, pred_geo = model(img)
43
44            loss = criterion(gt_score, pred_score, gt_geo, pred_geo, ignored_map)
45
46            epoch_loss += loss.item()
47            optimizer.zero_grad()
48            loss.backward()
49            optimizer.step()
50            scheduler.step()
51
52            if (batch + 1) % save_interval == 0:
53                performance_check(model, save_image_path="results/epoch_{}_batch_{}.jpg".format(epoch, batch + 1))
54
55            console_log.print(
56                'Epoch is [{}/{}], mini-batch is [{}/{}], time consumption is {:.8f}, batch_loss is {:.8f}'.format(
57                    epoch + 1, epoch_iter, batch + 1, int(file_num / batch_size), time.time() - start_time, loss.item()),
58                is_key_value=False
59            )
60
61        if (epoch + 1) % interval == 0:
62            state_dict = model.module.state_dict() if data_parallel else model.state_dict()
63            torch.save(state_dict, os.path.join(pths_path, 'model_epoch_{}.pth'.format(epoch + 1)))
64
65
66if __name__ == '__main__':
67    train_img_path = os.path.abspath('dataset/images')
68    train_gt_path = os.path.abspath('dataset/annotations')
69    pths_path = './pths'
70    batch_size = 24
71    lr = 1e-3
72    num_workers = 4
73    epoch_iter = 600
74    save_interval = 5
75    train(train_img_path, train_gt_path, pths_path, batch_size, lr, num_workers, epoch_iter, save_interval)

detect.py

Import

1from torchvision import transforms
2from PIL import Image, ImageDraw
3from models import EAST
4from dataset import get_rotate_mat
5from utils import nms_locality
6from device import device
7
8import config
9import torch
10import os
11import numpy as np
12import random

resize_img

1def resize_img(img):
2    '''resize image to be divisible by 32
3    '''
4    w, h = img.size
5    resize_w = w
6    resize_h = h
7
8    resize_h = resize_h if resize_h % 32 == 0 else int(resize_h / 32) * 32
9    resize_w = resize_w if resize_w % 32 == 0 else int(resize_w / 32) * 32
10    img = img.resize((resize_w, resize_h), Image.BILINEAR)
11    ratio_h = resize_h / h
12    ratio_w = resize_w / w
13
14    return img, ratio_h, ratio_w

load_pil

1def load_pil(img):
2    '''convert PIL Image to torch.Tensor
3    '''
4    t = transforms.Compose([transforms.ToTensor(), transforms.Normalize(mean=(0.5, 0.5, 0.5), std=(0.5, 0.5, 0.5))])
5    return t(img).unsqueeze(0)

is_valid_poly

1def is_valid_poly(res, score_shape, scale):
2    '''check if the poly in image scope
3    Input:
4            res        : restored poly in original image
5            score_shape: score map shape
6            scale      : feature map -> image
7    Output:
8            True if valid
9    '''
10    cnt = 0
11    for i in range(res.shape[1]):
12        if res[0, i] < 0 or res[0, i] >= score_shape[1] * scale or \
13                res[1, i] < 0 or res[1, i] >= score_shape[0] * scale:
14            cnt += 1
15    return True if cnt <= 1 else False

restore_polys

1def restore_polys(valid_pos, valid_geo, score_shape, scale=4):
2    '''restore polys from feature maps in given positions
3    Input:
4            valid_pos  : potential text positions <numpy.ndarray, (n,2)>
5            valid_geo  : geometry in valid_pos <numpy.ndarray, (5,n)>
6            score_shape: shape of score map
7            scale      : image / feature map
8    Output:
9            restored polys <numpy.ndarray, (n,8)>, index
10    '''
11    polys = []
12    index = []
13    valid_pos *= scale
14    d = valid_geo[:4, :]  # 4 x N
15    angle = valid_geo[4, :]  # N,
16
17    for i in range(valid_pos.shape[0]):
18        x = valid_pos[i, 0]
19        y = valid_pos[i, 1]
20        y_min = y - d[0, i]
21        y_max = y + d[1, i]
22        x_min = x - d[2, i]
23        x_max = x + d[3, i]
24        rotate_mat = get_rotate_mat(-angle[i])
25
26        temp_x = np.array([[x_min, x_max, x_max, x_min]]) - x
27        temp_y = np.array([[y_min, y_min, y_max, y_max]]) - y
28        coordidates = np.concatenate((temp_x, temp_y), axis=0)
29        res = np.dot(rotate_mat, coordidates)
30        res[0, :] += x
31        res[1, :] += y
32
33        if is_valid_poly(res, score_shape, scale):
34            index.append(i)
35            polys.append([res[0, 0], res[1, 0], res[0, 1], res[1, 1], res[0, 2], res[1, 2], res[0, 3], res[1, 3]])
36    return np.array(polys), index

get_boxes

1def get_boxes(score, geo, score_thresh=config.detection_score_threshold, nms_thresh=0.2):
2    '''get boxes from feature map
3    Input:
4            score       : score map from model <numpy.ndarray, (1,row,col)>
5            geo         : geo map from model <numpy.ndarray, (5,row,col)>
6            score_thresh: threshold to segment score map
7            nms_thresh  : threshold in nms
8    Output:
9            boxes       : final polys <numpy.ndarray, (n,9)>
10    '''
11    score = score[0, :, :]
12    xy_text = np.argwhere(score > score_thresh)  # n x 2, format is [r, c]
13    if xy_text.size == 0:
14        return None
15
16    xy_text = xy_text[np.argsort(xy_text[:, 0])]
17    valid_pos = xy_text[:, ::-1].copy()  # n x 2, [x, y]
18    # Due to ::-1, pos is now following (x, y) = (i, j) notational convention
19    valid_geo = geo[:, xy_text[:, 0], xy_text[:, 1]]  # 5 x n
20    # So is valid_geo
21    polys_restored, index = restore_polys(valid_pos, valid_geo, score.shape)
22    if polys_restored.size == 0:
23        return None
24
25    boxes = np.zeros((polys_restored.shape[0], 9), dtype=np.float32)
26    boxes[:, :8] = polys_restored
27    boxes[:, 8] = score[xy_text[index, 0], xy_text[index, 1]]
28    boxes = nms_locality(boxes.astype('float32'), nms_thresh)
29    return boxes

adjust_ratio

1def adjust_ratio(boxes, ratio_w, ratio_h):
2    '''refine boxes
3    Input:
4            boxes  : detected polys <numpy.ndarray, (n,9)>
5            ratio_w: ratio of width
6            ratio_h: ratio of height
7    Output:
8            refined boxes
9    '''
10    if boxes is None or boxes.size == 0:
11        return None
12    boxes[:, [0, 2, 4, 6]] /= ratio_w
13    boxes[:, [1, 3, 5, 7]] /= ratio_h
14    return np.around(boxes)

detect

1def detect(img, model, device):
2    '''detect text regions of img using model
3    Input:
4            img   : PIL Image
5            model : detection model
6            device: gpu if gpu is available
7    Output:
8            detected polys
9    '''
10    img, ratio_h, ratio_w = resize_img(img)
11    with torch.no_grad():
12        score, geo = model(load_pil(img).to(device))
13    boxes = get_boxes(score.squeeze(0).cpu().numpy(), geo.squeeze(0).cpu().numpy())
14    return adjust_ratio(boxes, ratio_w, ratio_h)

plot_boxes

1def plot_boxes(img, boxes):
2    '''plot boxes on image
3    '''
4    if boxes is None:
5        return img
6
7    draw = ImageDraw.Draw(img)
8    for box in boxes:
9        draw.polygon([box[0], box[1], box[2], box[3], box[4], box[5], box[6], box[7]], outline=(0, 255, 0))
10    return img

detect_dataset

1def detect_dataset(model, device, test_img_path, submit_path):
2    '''detection on whole dataset, save .txt results in submit_path
3    Input:
4            model        : detection model
5            device       : gpu if gpu is available
6            test_img_path: dataset path
7            submit_path  : submit result for evaluation
8    '''
9    img_files = os.listdir(test_img_path)
10    img_files = sorted([os.path.join(test_img_path, img_file) for img_file in img_files])
11
12    for i, img_file in enumerate(img_files):
13        print('evaluating {} image'.format(i), end='\r')
14        boxes = detect(Image.open(img_file), model, device)
15        seq = []
16        if boxes is not None:
17            seq.extend([','.join([str(int(b)) for b in box[:-1]]) + '\n' for box in boxes])
18        with open(os.path.join(submit_path, 'res_' + os.path.basename(img_file).replace('.jpg', '.txt')), 'w') as f:
19            f.writelines(seq)

performance_check

1def performance_check(model, save_image_path):
2    model.eval()
3    images = os.listdir("dataset/images")
4    random.shuffle(images)
5    img = Image.open("dataset/images/{}".format(images[0]))
6    boxes = detect(img, model, device)
7    plot_img = plot_boxes(img, boxes)
8    plot_img.save(save_image_path)
9    plot_img.save("results/latest_output.jpg")
10    model.train()

device

1import torch
2device = torch.device("cuda:0" if torch.cuda.is_available() else "cpu")

utils

Import

1from pydash.objects import get, set_
2from shapely.geometry import Polygon
3import numpy as np

iou

1def iou(g, p):
2    g = Polygon(g[:8].reshape((4, 2)))
3    p = Polygon(p[:8].reshape((4, 2)))
4    if not g.is_valid or not p.is_valid:
5        return 0
6    inter = Polygon(g).intersection(Polygon(p)).area
7    union = g.area + p.area - inter
8    if union == 0:
9        return 0
10    else:
11        return inter / union

weighted_merge

1def weighted_merge(g, p):
2    g[:8] = (g[8] * g[:8] + p[8] * p[:8]) / (g[8] + p[8])
3    g[8] = (g[8] + p[8])
4    return g

standard_nms

1def standard_nms(S, thres):
2    order = np.argsort(S[:, 8])[::-1]
3    keep = []
4    while order.size > 0:
5        i = order[0]
6        keep.append(i)
7        ious = np.array([iou(S[i], S[t]) for t in order[1:]])
8
9        inds = np.where(ious <= thres)[0]
10        # since order[0] is taken out
11        order = order[inds + 1]
12
13    return S[keep]

nms_locality

1def nms_locality(polys, thres=0.3):
2    '''
3    locality aware nms of EAST
4    :param polys: a N*9 numpy array. first 8 coordinates, then prob
5    :return: boxes after nms
6    '''
7    S = []
8    p = None
9    for g in polys:
10        if p is not None and iou(g, p) > thres:
11            p = weighted_merge(g, p)
12        else:
13            if p is not None:
14                S.append(p)
15            p = g
16    if p is not None:
17        S.append(p)
18
19    if len(S) == 0:
20        return np.array([])
21    return standard_nms(np.array(S), thres)

Contents

EAST: A Text Detection Algorithm

Contents

Results

dataset.py

Import

CustomDataset

extract_vertices

adjust_height

rotate_img

rotate_vertices

get_rotate_mat

crop_img

is_cross_text

get_score_geo

shrink_poly

find_min_rect_angle

cal_distance

get_boundary

rotate_all_pixels

move_points

cal_error

losses.py

Import

get_dice_loss

get_geo_loss

Loss

train.py

import

train

detect.py

Import

resize_img

load_pil

is_valid_poly

restore_polys

get_boxes

adjust_ratio

detect

plot_boxes

detect_dataset

performance_check

device

utils

Import

iou

weighted_merge

standard_nms

nms_locality