Author of the publication

InternEvo: Efficient Long-sequence Large Language Model Training via Hybrid Parallelism and Redundant Sharding.

, , , , , , , , , , and . CoRR, (2024)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Shape-Sensitive Feature Extraction for Large-Aspect-Ratio Object Detection., , , , , and . IEEE Geosci. Remote. Sens. Lett., (2024)Share Your Data Carefree: An Efficient, Scalable and Privacy-Preserving Data Sharing Service in Cloud Computing., , , , , and . IEEE Trans. Cloud Comput., 11 (1): 822-838 (January 2023)Attacking and Protecting Data Privacy in Edge-Cloud Collaborative Inference Systems., , and . IEEE Internet Things J., 8 (12): 9706-9716 (2021)Weighted Pseudo-θ-Almost Periodic Sequence and Finite-Time Guaranteed Cost Control for Discrete-Space and Discrete-Time Stochastic Genetic Regulatory Networks with Time Delays., , and . Axioms, 12 (7): 682 (July 2023)Exponential stability and synchronisation of fuzzy Mittag-Leffler discrete-time Cohen-Grossberg neural networks with time delays., , and . Int. J. Syst. Sci., 53 (11): 2318-2340 (2022)Dynamic behaviours for semi-discrete stochastic Cohen-Grossberg neural networks with time delays., , and . J. Frankl. Inst., 357 (17): 13006-13040 (2020)Adaptive Region Boosting method with biased entropy for path planning in changing environment., , , and . CAAI Trans. Intell. Technol., 1 (2): 179-188 (2016)Performance releaser with smart anchor learning for arbitrary-oriented object detection., , , , , , and . CAAI Trans. Intell. Technol., 8 (4): 1213-1225 (December 2023)Design, Implementation and Verification of Cloud Architecture for Monitoring a Virtual Machine's Security Health., and . IEEE Trans. Computers, 67 (6): 799-815 (2018)InternEvo: Efficient Long-sequence Large Language Model Training via Hybrid Parallelism and Redundant Sharding., , , , , , , , , and 1 other author(s). CoRR, (2024)