**Charlie Loyd** @vruba@everything.happens.horse · Apr 14, 2024, 18:30

**Charlie Loyd** @vruba@everything.happens.horse · Apr 14, 2024, 18:30

Charlie Loyd @vruba@everything.happens.horse

Image→image ANN design opinions

Based on informal tinkering and reading. Happy to discuss, within reason.

1. QKV attention transformers work but are gross – O(n²) and equivariance problems – and will not last.

2. Most – not all! – learned convolutions are wasted and should be replaced by fixed bases or frames, or at least grouped convolutions.

3. Diffusion and flow training approaches are immature but way more elegant than x→y training.

5. MMA regularization is good.

**Whine Enthusiast** @secretasianman@types.pl · Apr 14, 2024, 18:34

**Whine Enthusiast** @secretasianman@types.pl · Apr 14, 2024, 18:34

Whine Enthusiast @secretasianman@types.pl

Image→image ANN design opinions

@vruba@everything.happens.here because I am trying to imagine what mixed-martial arts regularization would look like :p

**Charlie Loyd** @vruba@everything.happens.horse · 2024-04-14T18:51:34Z

Charlie Loyd @vruba@everything.happens.horse

Image→image ANN design opinions

@secretasianman Their podcasters are terrible.

Apr 14, 2024, 18:51 · · Web · · ·