The model used is a quantized version of `Llama-3-Taiwan-8B-Instruct`. More details can be found on the https://huggingface.co/yentinglin/Llama-3-Taiwan-8B-Instruct