The open-source efficient inference model of AntBrain reduces the inference cost by more than 50%.

The open-source efficient inference model of AntBrain reduces the inference cost by more than 50%.


ant bailing big model team recently announced the release of two new high-efficiency inference models (ring-flash-linear-2.0 and ring-mini-linear-2.0) designed specifically to improve the inference efficiency of deep learning as open-source. in addition, two high-performance fusion operators independently developed (fp8 fusion operator and linear attention inference fusion operator) were also released. these operators support efficient inference with large parameters and low activation numbers, and can handle very long contexts.

thanks to the synergistic effect of architecture optimization and high-performance operators, these new models achieve only one-tenth of the cost compared with high-density models of the same scale in deep learning scenarios. this corresponds to a reduction of more than 50% compared with the previous generation of the ring series. this means that users can greatly reduce the consumption of computing resources when running complex inferences and improve efficiency. furthermore, by closely integrating the operators of the training engine and the inference engine, models can be optimized stably over a long period of time during reinforcement learning, achieving cutting-edge performance in multiple high-difficulty inference rankings.

both models are currently open-sourced on platforms such as hugging face and modelscope, allowing developers to access and experiment with the models. this open-sourcing not only demonstrates ant financial's technological capabilities in the ai field, but also is expected to promote further breakthroughs in ai research and application by providing developers with efficient tools.

The open-source efficient inference model of AntBrain reduces the inference cost by more than 50%.

ant bailing big model team recently announced the release of two new high-efficiency inference models (ring-flash-linear-2.0 and ring-mini-linear-2.0) designed

The open-source efficient inference model of AntBrain reduces the inference cost by more than 50%.

The FF 91 2.0 from Faraday Future has been delivered to a real estate magnate in Southern California, marking the birth of a new B2B2C model.

faraday future (ff) recently announced that calvin gong, president of pinnacle real estate group, will officially deliver the next-generation ff 91 2.0 futuris

The FF 91 2.0 from Faraday Future has been delivered to a real estate magnate in Southern California, marking the birth of a new B2B2C model.

The Korean food delivery platform has integrated Alipay and WeChat for the first time and welcomed the new visa-free policy for Chinese tourists.

it was reported that south korea's largest food delivery platform, "baedal minjok" (badal minjok), officially integrated alipay and wechat pay on september 25th

The Korean food delivery platform has integrated Alipay and WeChat for the first time and welcomed the new visa-free policy for Chinese tourists.

Mr. Altman predicts that AGI will become available by 2030: AI will reshape the model of future work.

sam altman, the ceo of openai, recently made an important prediction that by 2030, a general artificial intelligence (agi) capable of surpassing human intellig

Mr. Altman predicts that AGI will become available by 2030: AI will reshape the model of future work.

The battery life showdown between the Apple iPhone Air and the Samsung Galaxy S25 Edge: the winner will be determined in just one minute.

despite having a smaller battery capacity, the apple iphone air was expected to take the lead in battery life thanks to its c1x 5g modem and proprietary n1 wir

The battery life showdown between the Apple iPhone Air and the Samsung Galaxy S25 Edge: the winner will be determined in just one minute.

The Google Gemini series models have been upgraded again, with significant improvements in speed and efficiency.

google recently released a major update to its large-scale language model, the gemini series. particularly noteworthy is the release of gemini 2.5 flash and fl

The Google Gemini series models have been upgraded again, with significant improvements in speed and efficiency.

The first electric car from Lamborghini isn't actually a car at all.

an italian brand known for its supercars has unveiled its first fully electric vehicle. however, what was announced this time was not a car, but an electric wa

The first electric car from Lamborghini isn't actually a car at all.

The new porous material could potentially extend battery life by several times.

scientists from the helmholtz center in berlin and the technical university of berlin have made significant progress in the development of next-generation lith

The new porous material could potentially extend battery life by several times.

A durable and inexpensive building material made from cardboard and mud.

australian scientists have demonstrated a new building material made from cardboard, soil, and water. it is suitable for low-rise buildings and has the potenti

A durable and inexpensive building material made from cardboard and mud.

Ducati's electric motorcycle uses a solid battery and can reach a speed of 273 kilometers per hour.

ducati has unveiled a prototype of its electric sports bike, the v21l. one of its main features is the adoption of a solid-state battery. this is the first time

Ducati's electric motorcycle uses a solid battery and can reach a speed of 273 kilometers per hour.

Two-armed micro-robots are set to revolutionize the way electronic devices are manufactured.

the san francisco-based startup microfactory has unveiled a robot that is expected to revolutionize small-scale manufacturing and automation as a whole. this p

Two-armed micro-robots are set to revolutionize the way electronic devices are manufactured.

An underwater solar panel boasting record-breaking efficiency has been developed.

south korean scientists have demonstrated that solar panels can operate efficiently not only on land but also underwater. this advancement opens up new possibi

An underwater solar panel boasting record-breaking efficiency has been developed.

AinRide announces an unmanned, electric truck for autonomous driving.

the world's first fully autonomous large-scale electric truck has started test runs at the port of antwerp-bruges, one of europe's largest logistics hubs. this

AinRide announces an unmanned, electric truck for autonomous driving.

Adopting an unusual wheel concept instead of an engine and a transmission.

david henson, an inventor from denver, has come up with an unusual idea. it involves wheels that function as engines, with the potential to completely replace

Adopting an unusual wheel concept instead of an engine and a transmission.

The researchers are teaching people how to capture the "hidden colors" with their smartphone cameras.

it's possible to turn ordinary photos taken with smartphones into scientific tools. scientists have discovered a way to extract hidden spectral information fro

The researchers are teaching people how to capture the "hidden colors" with their smartphone cameras.