.gitignore | ||
bun.lockb | ||
index.ts | ||
package.json | ||
README.md | ||
test.cpp | ||
tsconfig.json |
Encoding Fix Tool
Prepares source files for conversion of encoding from EUC-KR to UTF-8.
Background
Most files in the source were originally written using the EUC-KR encoding. This would be fine if only comments were using characters that only exist in that encoding.
However, the original devs used EUC-KR also in string literals, which in turn are sent to the client or localized directly on the server and act as a lookup key.
If we simply convert the whole file from EUC-KR to UTF-8, these lookups will break since not all references are server-side and we want to keep compatibility with existing systems (client, quests, etc).
Therefore, we convert characters that are not valid UTF-8 characters used in string literals to their byte's string representation.
We leave comments untouched in order to convert those in bulk with a iconv
find . -name '*.cpp' -exec iconv -f EUC-KR -t UTF-8//TRANSLIT -o {}_u {} \; -exec mv {}_u {} \;
Repeat for the desired file extensions.
Usage
To install dependencies:
bun install
To run:
bun run index.ts
This project was created using bun init
in bun v1.1.1. Bun is a fast all-in-one JavaScript runtime.