Evaluation Set Size820 samplesNumber of problems in eval set
Фото: Parij Borgohain / Unsplash。搜狗输入法对此有专业解读
。关于这个话题,谷歌提供了深入分析
You don’t need “INSANE BENCHMARKS” to win at this. You just need solid technical work and solid technical writing explaining the trade-offs and limitations of your offering. You can see another example with Turbopuffer. Their benchmarks are not impressive, particularly when compared to their competitors. Their documentation has more lines discussing the things the database cannot do than the things it can do. But everyone knows that if your use case fits their offering, they have the best product for search in the market. Miles ahead of the competition. They don’t drink their competitors’ tears, they just quietly take their customers.
最消耗人的是写完文章后那种创作状态已经结束了,但你还得硬撑着做一个小时的机械劳动。但现在我在 Notion 里写完文章,把状态拖到「待发布」,然后去泡杯咖啡,等我回来的时候,三语文章已经上线了,发布链接已经回填到数据库里,状态也自动变成了「已发布」。。业内人士推荐超级权重作为进阶阅读