🏷 Tag

llm · 205 topics

research (103)

2026 · Jun

2026 · May

05-31🔥🔥

parallax parameterized local linear attention

05-31🔥🔥

probe targeted fine tuning

05-30🔥

a moment of thanks for deepseek

05-29🔥🔥🔥

laguna m1 xs2

05-29🔥🔥

itbench aa frontier models score below 50 percent

05-29🔥🔥

2026 05 29 papers 2605.27375

05-25🔥🔥

nvidia nemotron labs diffusion

05-25🔥🔥

psibotai syndata

05-24🔥🔥🔥

cohere command a plus 218b moe

05-23🔥

2026 05 23 papers 2605.20189 solar lifelong learning

05-22🔥🔥

gemini system prompt leak

05-22🔥🔥🔥

google gemini 3 5 flash vs all

05-21🔥🔥

huggingface ettin reranker family release

05-21🔥🔥

internlm intern s2 preview 35b

05-20🔥🔥🔥

google gemini 3 5 flash agentic performance

05-20🔥🔥🔥

gemini 3 5 flash

05-19🔥🔥🔥

transformer scalability crisis

05-19🔥🔥🔥

nvidia nemotron personas korea

05-19🔥🔥🔥

qwen 3 7 dropped on qwen chat

05-17🔥🔥

llm architectures kv sharing mhc

05-17🔥🔥

arxiv llm error ban

05-16🔥🔥🔥

teichai deepseek v4 pro agent dataset

05-15🔥🔥🔥

inclusionai ring 2 6 1t

05-15🔥🔥

hermes agent reasoning traces

05-14🔥🔥🔥

mimo v25 pro opensource

05-14🔥🔥

modotte codex 2m thinking

05-13🔥🔥🔥

jina embeddings v5 omni

05-11🔥🔥

llms corrupt documents delegation

05-11🔥🔥

hy mt 1 5 1 8b 1 25bit quantization

05-11🔥🔥

hidream o1 image uit

05-10🔥🔥

teaching claude why

05-09🔥🔥

allen institute emo moe modularity

05-09🔥🔥

cybersecqwen 4b

05-09🔥🔥

ai2 emo moe

05-08🔥🔥

natural language autoencoders

05-08🔥🔥🔥

openai voice intelligence api

05-08🔥🔥🔥

gpt 5 5 and cyber trusted access

05-06🔥🔥🔥

gpt 5 5 instant system card

05-06🔥

microsoft nsdi 2026 advances

05-05🔥🔥

inclusionai ling 2 6 flash release

05-05🔥🔥

qwen3 6 27b dflash speculative decoding

05-05🔥🔥

autobe benchmark backend generation

05-04🔥🔥🔥

kimi k2 6 beats gpt 5 5 coding

05-04🔥🔥

harvard o1 er diagnosis

05-04🔥🔥

evolving deep learning optimizers

05-03🔥🔥

refusal in language models is mediated by a single direction

05-03🔥🔥

deepseek v4 flash

05-03🔥🔥

nvidia nemotron 3 nano omni 30b a3b reasoning bf16

05-03🔥🔥

unsloth qwen3 6 27b gguf

05-02🔥🔥

deepseek v4 series release

05-02🔥🔥

ai outperforms er doctors diagnostic cases

05-02🔥🔥

grok 4 3 benchmark performance

05-02🔥🔥

llm refusal single direction

05-02🔥🔥🔥

fineweb edu

05-02🔥🔥

xiaomimimo mimo v2 5

05-02🔥🔥

xiaomimimo mimo v2.5 pro

05-01🔥🔥

qwen3 6 27b uncensored hauhaucs aggressive

05-01🔥🔥

gpt 55 cyber capabilities

05-01🔥🔥

red teaming a network of agents

2026 · Apr

tools (55)

2026 · Jun

2026 · May

2026 · Apr