Zhiling Lan
Title
Cited by
Cited by
Year
Toward automated anomaly identification in large-scale systems
Z Lan, Z Zheng, Y Li
IEEE Transactions on Parallel and Distributed Systems 21 (2), 174-187, 2009
1172009
System log pre-processing to improve failure prediction
Z Zheng, Z Lan, BH Park, A Geist
2009 IEEE/IFIP International Conference on Dependable Systems & Networks …, 2009
1162009
Exploit failure prediction for adaptive fault-tolerance in cluster computing
Y Li, Z Lan
Sixth IEEE International Symposium on Cluster Computing and the Grid (CCGRID …, 2006
872006
A survey of load balancing in grid computing
Y Li, Z Lan
International Conference on Computational and Information Science, 280-285, 2004
862004
Dynamic load balancing of SAMR applications on distributed systems
Z Lan, VE Taylor, G Bryan
SC'01: Proceedings of the 2001 ACM/IEEE Conference on Supercomputing, 24-24, 2001
852001
Co-analysis of RAS log and job log on Blue Gene/P
Z Zheng, L Yu, W Tang, Z Lan, R Gupta, N Desai, S Coghlan, D Buettner
2011 IEEE International Parallel & Distributed Processing Symposium, 840-851, 2011
792011
Lightweight silent data corruption detection based on runtime data analysis for HPC applications
E Berrocal, L Bautista-Gomez, S Di, Z Lan, F Cappello
Proceedings of the 24th International Symposium on High-Performance Parallel …, 2015
752015
Integrating dynamic pricing of electricity into energy aware scheduling for HPC systems
X Yang, Z Zhou, S Wallace, Z Lan, W Tang, S Coghlan, ME Papka
SC'13: Proceedings of the International Conference on High Performance …, 2013
732013
Practical online failure prediction for blue gene/p: Period-based vs event-driven
L Yu, Z Zheng, Z Lan, S Coghlan
2011 IEEE/IFIP 41st International Conference on Dependable Systems and …, 2011
712011
Analyzing and adjusting user runtime estimates to improve job scheduling on the Blue Gene/P
W Tang, N Desai, D Buettner, Z Lan
2010 IEEE International Symposium on Parallel & Distributed Processing …, 2010
692010
Reliability-aware scalability models for high performance computing
Z Zheng, Z Lan
2009 IEEE International Conference on Cluster Computing and Workshops, 1-9, 2009
682009
Fault-aware, utility-based job scheduling on blue, gene/p systems
W Tang, Z Lan, N Desai, D Buettner
2009 IEEE International Conference on Cluster Computing and Workshops, 1-10, 2009
662009
Dynamic load balancing for structured adaptive mesh refinement applications
Z Lan, VE Taylor, G Bryan
International Conference on Parallel Processing, 2001., 571-579, 2001
652001
A meta-learning failure predictor for blue gene/l systems
P Gujrati, Y Li, Z Lan, R Thakur, J White
2007 International Conference on Parallel Processing (ICPP 2007), 40-40, 2007
622007
Reducing energy costs for IBM Blue Gene/P via power-aware job scheduling
Z Zhou, Z Lan, W Tang, N Desai
Workshop on Job Scheduling Strategies for Parallel Processing, 96-115, 2013
602013
A practical failure prediction with location and lead time for blue gene/p
Z Zheng, Z Lan, R Gupta, S Coghlan, P Beckman
2010 International Conference on Dependable Systems and Networks Workshops …, 2010
602010
Dynamic meta-learning for failure prediction in large-scale systems: A case study
J Gu, Z Zheng, Z Lan, J White, E Hocks, BH Park
2008 37th International Conference on Parallel Processing, 157-164, 2008
602008
A novel dynamic load balancing scheme for parallel systems
Z Lan, VE Taylor, G Bryan
Journal of Parallel and Distributed Computing 62 (12), 1763-1781, 2002
582002
Watch out for the bully! job interference study on dragonfly network
X Yang, J Jenkins, M Mubarak, RB Ross, Z Lan
SC'16: Proceedings of the International Conference for High Performance …, 2016
572016
A study of dynamic meta-learning for failure prediction in large-scale systems
Z Lan, J Gu, Z Zheng, R Thakur, S Coghlan
Journal of Parallel and Distributed Computing 70 (6), 630-643, 2010
562010
The system can't perform the operation now. Try again later.
Articles 1–20