Author of the publication

A Study on Knowledge Distillation from Weak Teacher for Scaling Up Pre-trained Language Models.

, , , , , and . ACL (Findings), page 11239-11246. Association for Computational Linguistics, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Unstructured big data analysis algorithm and simulation of Internet of Things based on machine learning., , , and . Neural Comput. Appl., 32 (10): 5399-5407 (2020)Using random forest algorithm to predict super-secondary structure in proteins., , , , and . J. Supercomput., 76 (5): 3199-3210 (2020)Dynamical complexity of pricing and green level for a dyadic supply chain with capital constraint., , , , and . Math. Comput. Simul., (2022)An information modeling framework for bridge monitoring., , , , and . Adv. Eng. Softw., (2017)Two-period pricing strategy in a supply chain with intertemporal and horizontal reference price effects., , and . INFOR Inf. Syst. Oper. Res., 59 (4): 639-667 (2021)Complex dynamic analysis of risk-averse newsvendor models with buyback guarantee financing., , , and . Int. J. Prod. Res., 60 (9): 2865-2883 (2022)An Efficient Method of Crowd Aggregation Computation in Public Areas., , , , , and . IEEE Trans. Circuits Syst. Video Technol., 28 (10): 2814-2825 (2018)A Novel Probabilistic Saturating Counter Design for Secure Branch Predictor., , , , , and . J. Comput. Sci. Technol., 36 (5): 1022-1036 (2021)Int-Monitor: a model triggered hardware trojan in deep learning accelerators., and . J. Supercomput., 79 (3): 3095-3111 (2023)Constructing Edge-Colored Graph for Heterogeneous Networks., , , , and . J. Comput. Sci. Technol., 30 (5): 1154-1160 (2015)