BUBECKS, CHANDRASEKARANV, ELDANR, et al. Sparks of Artificial General Intelligence: Early experiments with GPT-4[J/OL]. [2024-03-02]. https://arxiv.org/pdf/2303.12712
[3]
冯志伟. 科技术语翻译的原则[N]. 光明日报, 2023-05-21,语言文字专栏.
[4]
VASWANIA, SHAZEERN, PARMARN, et al. Attention is all you need[M]//Guyon I, Luxburg U V, Bengio S, et al. Advances in Neural Information Processing Systems: volume 30. Curran Associates, Inc., 2017. https://proceedings.neurips.cc/paper/2017/file/ 3f5ee243547dee91fbd053c1c4a845aa-Paper.pdf
[5]
KAPLANJ, MCCANDLISHS, HENIGHANT, et al. Scaling laws for neural language models[J]. arXiv preprint arXiv:2001.08361, 2020.