haiku os m1 mac boot
apple silicon local llm cost analysis
google magenta mrt2 low latency
basert best in class llm inference on apple silicon via native metal