Skip to content

Takes .ass / .ssa Japanese subtitle files and creates a frequency list based on those words.

Notifications You must be signed in to change notification settings

NessDan/japanese-subs-to-word-frequency

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Get the most used Japanese words from subtitle files

Put your ass or ssa files in the subs/ directory and run node subparser.js and then node index.js.

Requirements

You'll need node.js and mecab. Also make sure to npm install.

Results

You'll get an output.txt files with frequencies. It will look like this:

の,9274
て,8090
は,7175
に,7050
た,6444
が,5248
だ,4955
を,4521
ない,3577
で,3483

About

Takes .ass / .ssa Japanese subtitle files and creates a frequency list based on those words.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published