Looking for peer indie researchers or grassroots R&D team šŸ¤

Hi everyone, I’m Jen, an indie researcher with a math background.

I’m always curious to find other ā€œgrassrootsā€ R&D teams or individuals who are also in the weeds, building foundational models or systems from first principles.

My current focus is on math reasoning and the ā€œdistributed chaosā€ :turtle: of optimizers like Muon/FSDP.

Just wanted to put a signal out:

  • Are there other independent or small ā€œgrassrootsā€ teams out there I should be following?

  • Who else here is deep-diving into math reasoning problems?

Always looking to connect with fellow trailblazers and see what hard problems everyone is tackling!

3 Likes

Good topic. I’m in that camp. Prototyping a modular opportunistic control system targeting domains where linear reasoning is brittle and brute force is useless (open-ended, unbounded problems).

Cheers.

2 Likes

Thanks for the reply. Your description of a ā€œmodular opportunistic control systemā€ for unbounded problems is fascinating.

It reminds me of the multi-faceted (and somewhat ā€œchaoticā€) approach in Moonshot’s Kimi K2 paper—synthesizing domains, tools, agents, and rubrics all at once.

I’m curious if your work is focused more on the post-training phase (like complex reward models) or on the agentic workflow itself (like the real-time API/tool orchestration)?

For me, my current project is a scratch-build of ReTool paper from ByteDance Seed, so I’m always excited to find other researchers working in this space.

1 Like

Very cool. My understanding is that ReTool takes a kimd of workflow-centric view of orchestration. Something like a declarative DAG of tool calls?

I’m experimenting with a more signal-driven controller that learns when to reallocate attention between subtasks. But it’s a complementary path.

My orchestration layer treats models like K2 as components in a control loop that learns when to change direction, reallocate effort, coordinate subtasks, etc. One key feature is using a variety of custom heuristics to measure and respond to ā€œstucknessā€ on sub tasks. I’m basically treating stuckness as a first class signal instead of a nuisance.

2 Likes

This is a bit off-topic, but I found a real-world example of the Muon optimizer in action? I’ll share it here.

2 Likes

wow, thanks! I will look into it.

1 Like

This is a really interesting approach. ā€˜Stuckness as a first class signal’ is a cool concept. Is there a paper or even a blog post I could read to learn more about this ā€˜signal-driven controller’ idea? I’m always curious to see how different researchers are framing these orchestration problems.

1 Like

In the works, but planning comprehensive documentation after implementation to avoid unnecessary rewrites. I had thought to begin blogging but time has been limited as of late.

If you get interested in collaboration, maybe we can cross pollinate ideas and see what blooms.

Cheers.

2 Likes

Grizz here - I’m from Texas, background in emergency medicine and E.M.S. Operations for over 17 years approaching the whole AI consciousness questions from a radically different angle…

2 Likes