エントリーの編集は全ユーザーに共通の機能です。
必ずガイドラインを一読の上ご利用ください。
ここにツイート内容が記載されます https://b.hatena.ne.jp/URLはspanで囲んでください
Twitterで共有ONにすると、次回以降このダイアログを飛ばしてTwitterに遷移します
注目コメント算出アルゴリズムの一部にLINEヤフー株式会社の「建設的コメント順位付けモデルAPI」を使用しています
Curious if anybody out there is trying to build a new model/architecture that would succeed the t... Curious if anybody out there is trying to build a new model/architecture that would succeed the transf ormer?I geek out on this subject in my spare time. Curious if anybody else is doing so and if you're willing to share ideas? The MAMBA [1] model gained some traction as a potential successor. It's basically an RNN without the non linearity applied across hidden states, which makes it logarithmic t