Log inSign up
rohan anil
Core Automation
10.7K posts
user avatar
rohan anil
Core Automation
@_arohan_
member of technical staff & co-founder of @coreautoai - and continuing to aspire to understand deep learning.
Joined December 2017
2,326
Following
43.2K
Followers
  • 已置顶
    user avatar
    rohan anil
    Core Automation
    @_arohan_
    4月19日
    It turns out multi step backpropaganda is better. paper has a beautiful way of improving backpropagation. One iteration cleanly gets us backprop, multiple iterations get us a preconditioned update.
    user avatar
    rohan anil
    Core Automation
    @_arohan_
    4月19日
    Replying to @LinYorker @ryu0000000001 and @weijie444
    arxiv.org/abs/2106.06199 Same update here
    116K
  • user avatar
    rohan anil
    Core Automation
    @_arohan_
    2025年6月5日
    A little bit of update from me: I will join the awesome team at @AnthropicAI in two weeks.
    147K
  • user avatar
    rohan anil
    Core Automation
    @_arohan_
    2025年6月5日
    92.9K
  • user avatar
    rohan anil
    Core Automation
    @_arohan_
    2025年1月13日
    Joining the Llama team @AIatMeta today! Time to train models, finally gpu rich :)
    90.6K
  • user avatar
    rohan anil
    Core Automation
    @_arohan_
    2025年11月6日
    Near the office. SF has stepped up its dosa game.
    82K
  • user avatar
    rohan anil
    Core Automation
    @_arohan_
    2022年12月7日
    This paper looks like a big step forward for the Transformer architecture! A foundational improvements, not as shiny as other things, but really big step forward nonetheless
  • user avatar
    rohan anil
    Core Automation
    @_arohan_
    2025年11月10日
    Reading this, its clear that Meta is advancing / recommender systems tech faster than other places including G.
    user avatar
    Engineering at Meta
    Meta
    @Meta_Engineers
    2025年11月10日
    We’re excited to share details on Meta’s Generative Ads Recommendation Model (GEM), a new foundational model built with LLM-scale techniques that’s already helping create more value for businesses, like +5% increase in ad conversions on Instagram. Dive deep into the technology
    174K
  • user avatar
    rohan anil
    Core Automation
    @_arohan_
    2024年12月21日
    Man, claude solved this verbally by looking at the inputs visually.
    user avatar
    François Chollet
    @fchollet
    2024年12月20日
    Replying to @fchollet
    It will also be extremely important to analyze the strengths and limitations of the new system. Here are some examples of tasks that o3 couldn't solve on high-compute settings (even as it was generating millions of CoT search tokens and consuming thousands of dollars of compute
    95.5K
  • user avatar
    rohan anil
    Core Automation
    @_arohan_
    2024年12月6日
    A bitter sweet moment for me, Gemini is doing really well, and teams are doing great. I had a great close to 12 years at G that one could call me OG. For example, for every search query, I noticed things I was able to contribute to is deeply integrated from the retriever to the
    82.9K
  • user avatar
    rohan anil
    Core Automation
    @_arohan_
    2023年9月16日
    Meta researchers just dropped PyTorch distributed shampoo🧴few days ago: arxiv.org/pdf/2309.06497… 💥 Train neural networks with a second order method for better performance. This underlying work which it is based on has been a passion project for last 5 years while swimming
    113K
  • user avatar
    rohan anil
    Core Automation
    @_arohan_
    2025年10月11日
    That’s insane to convince a cofounder of thinky to bail this fast.
    user avatar
    Meghan Bobrowsky
    @MeghanBobrowsky
    2025年10月11日
    Saturday scoop: Thinking Machines Lab co-founder Andrew Tulloch has joined Meta, the startup confirmed. W/ @keachhagey
    86.9K
  • user avatar
    rohan anil
    Core Automation
    @_arohan_
    2024年10月9日
    I got to coauthor papers with two Nobel prize winners, one in Physics and one in Chemistry 😁
    26.7K
  • user avatar
    rohan anil
    Core Automation
    @_arohan_
    2023年12月6日
    It’s been a privilege to work alongside with our gemini leads and team (across Google DeepMind, Research and Alphabet) in one of the most interesting and challenging projects of my career. We have three versions of Gemini: (a) Ultra (b) Pro and (c) Nano We make significant
    user avatar
    Jeff Dean
    @JeffDean
    2023年12月6日
    I’m very excited to share our work on Gemini today! Gemini is a family of multimodal models that demonstrate really strong capabilities across the image, audio, video, and text domains. Our most-capable model, Gemini Ultra, advances the state of the art in 30 of 32 benchmarks,
    157K
  • user avatar
    rohan anil
    Core Automation
    @_arohan_
    2022年6月22日
    A new image generation model just dropped. parti.research.google Great work by the team! + Auto-regressive, encoder->decoder Transformer + Classifier-free sampling. + ViT-VQGAN Really amazing results: Image from the website.

New to X?

Sign up now to get your own personalized timeline!

Create account

By signing up, you agree to the Terms of Service and Privacy Policy, including Cookie Use.

Terms·Privacy·Cookies·Accessibility·Ads Info·© 2026 X Corp.
Don't miss what's happening
People on X are the first to know.
Log inSign up