Learning Korean with MLX feat. Mistral 7B

less than 1 minute read

Goal

Train on Korean data with MLX
Build a text-generating AI that writes wuxia novels in the style of Jin Yong

Result: Failure

But it did get a little better at Korean

Training Process

Machine: M1 Pro
Framework: MLX
Training model: Mistral 7B
Training data: The Return of the Condor Heroes, converted into JSONL format myself
Time spent: 1 hour
Training method: LoRa
lora script: pythyon lora.py –model ./mlx_model –train –iters 600

Training Results

Mistral 7B base model: barely speaks Korean

fail

Lora: it does respond

half

python lora.py –model ./mlx_model –adapter-file ./adapters.npz –max-tokens 50 –prompt “서독 구양봉이 사용하는 무공의 위력은 “

Presumed Cause of Failure

Insufficient Korean training of the model: a model trained on the original English text (Will Durant’s The Story of Civilization, Volume 13, Greece-Rome model training) works well

Lack of data: 8 books’ worth of data seems insufficient

Future Tasks

Collect high-quality data for Korean training

Gain a deeper understanding of MLX

Review successful Korean training examples

20240130

Share on

Twitter Facebook LinkedIn

아름다움의 진화

less than 1 minute read

공교롭게도 이 책 안에서 최근에 읽은 책 2권이 언급되었다. 진화생물학에서의 열렬한 적응주의자로서 리처드 도킨스에 대한 언급은 놀랍지 않았다. 하지만 로버트 실러에 대한 언급은 뜻밖이었다. 저자와 함께 예일대 교수를 하고 있는 이웃인지라 함께 자주 점심을 먹는다고 한다.

The Evolution of Beauty

1 minute read

By coincidence, two books I had recently read were mentioned in this one. As a fervent adaptationist in evolutionary biology, Richard Dawkins’s appearance wa...

병신과 머저리

less than 1 minute read

영화 버닝과 유사하지 않냐는 평가에 궁금하여 읽었다. 과연 비슷한 점이 있다. 두 작품 모두 현실과 꿈이 구분이 되지 않아 뒤죽박죽이 되는데, 이 간극을 해소할 길이 없다. 독자는 흩어진 현실과 환상을 뒤적여 원하는 방향으로 취사선택 하는 수 밖에 없다. 하지만 버닝에서는 이제야 ...

The wounded

less than 1 minute read

I picked up this novel after hearing comparisons to the film Burning, curious to see whether the resemblance was real. It turns out that the two works do sha...

YoungSeon.Ahn

Learning Korean with MLX feat. Mistral 7B

Goal

Result: Failure

Training Process

Training Results

Mistral 7B base model: barely speaks Korean

Lora: it does respond

Presumed Cause of Failure

Future Tasks

Share on

Leave a comment

You may also enjoy

아름다움의 진화

The Evolution of Beauty

병신과 머저리

The wounded