Shihan Dou

Cited by

	All	Since 2019
Citations	629	629
h-index	10	10
i10-index	10	10

360

180

270

202120222023202410 40 221 356

Public access

View all

5 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Huang Xuanjing (黄萱菁)Professor of Computer Science, Fudan UniversityVerified email at fudan.edu.cn
Qi Zhang (张奇)Professor of Computer Science, Fudan UniversityVerified email at fudan.edu.cn
Yueming WuNanyang Technological UniversityVerified email at ntu.edu.sg
Tao Gui （桂韬）复旦大学Verified email at fudan.edu.cn
Hai JinHuazhong University of Science and TechnologyVerified email at hust.edu.cn
Rui ZhengFudan UniversityVerified email at fudan.edu.cn
Xipeng Qiu（邱锡鹏）Professor of Computer Science, Fudan UniversityVerified email at fudan.edu.cn

Shihan Dou

Fudan University

Verified email at m.fudan.edu.cn

Alignment RLHF Natural Language Processing Security


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
The rise and potential of large language model based agents: A survey Z Xi, W Chen, X Guo, W He, Y Ding, B Hong, M Zhang, J Wang, S Jin, ... arXiv preprint arXiv:2309.07864, 2023	280	2023
Vulcnn: An image-inspired scalable vulnerability detection system Y Wu, D Zou, S Dou, W Yang, D Xu, H Jin Proceedings of the 44th International Conference on Software Engineering …, 2022	71	2022
SCDetector: Software functional clone detection based on semantic tokens analysis Y Wu, D Zou, S Dou, S Yang, W Yang, F Cheng, H Liang, H Jin Proceedings of the 35th IEEE/ACM international conference on automated …, 2020	55	2020
Secrets of RLHF in large language models part I: PPO R Zheng, S Dou, S Gao, Y Hua, W Shen, B Wang, Y Liu, S Jin, Q Liu, ... arXiv preprint arXiv:2307.04964, 2023	51*	2023
IntDroid: Android malware detection based on API intimacy analysis D Zou, Y Wu, S Yang, A Chauhan, W Yang, J Zhong, S Dou, H Jin ACM Transactions on Software Engineering and Methodology (TOSEM) 30 (3), 1-32, 2021	36	2021
MINER: Improving out-of-vocabulary named entity recognition from an information theoretic perspective X Wang, S Dou, L Xiong, Y Zou, Q Zhang, T Gui, L Qiao, Z Cheng, ... arXiv preprint arXiv:2204.04391, 2022	25	2022
Obfuscation-resilient android malware analysis based on contrastive learning Y Wu, S Dou, D Zou, W Yang, W Qiang, H Jin arXiv preprint arXiv:2107.03799, 2021	17	2021
Secrets of RLHF in Large Language Models Part II: Reward Modeling B Wang, R Zheng, L Chen, Y Liu, S Dou, C Huang, W Shen, S Jin, E Zhou, ... arXiv preprint arXiv:2401.06080, 2024	14*	2024
LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment S Dou, E Zhou, Y Liu, S Gao, J Zhao, W Shen, Y Zhou, Z Xi, X Wang, ... arXiv preprint arXiv:2312.09979, 2023	13*	2023
Towards understanding the capability of large language models on code clone detection: a survey S Dou, J Shan, H Jia, W Deng, Z Xi, W He, Y Wu, T Gui, Y Liu, X Huang arXiv preprint arXiv:2308.01191, 2023	11*	2023
Loose lips sink ships: Mitigating length bias in reinforcement learning from human feedback W Shen, R Zheng, W Zhan, J Zhao, S Dou, T Gui, Q Zhang, X Huang arXiv preprint arXiv:2310.05199, 2023	9	2023
Contrastive learning for robust android malware familial classification Y Wu, S Dou, D Zou, W Yang, W Qiang, H Jin IEEE Transactions on Dependable and Secure Computing, 2022	9	2022
Kernel-whitening: Overcome dataset bias with isotropic sentence embedding S Gao, S Dou, Q Zhang, X Huang arXiv preprint arXiv:2210.07547, 2022	6	2022
Decorrelate irrelevant, purify relevant: Overcome textual spurious correlations from a feature perspective S Dou, R Zheng, T Wu, S Gao, J Shan, Q Zhang, Y Wu, X Huang arXiv preprint arXiv:2202.08048, 2022	6	2022
Tailoring Personality Traits in Large Language Models via Unsupervisedly-Built Personalized Lexicons T Li, S Dou, C Lv, W Liu, J Xu, M Wu, Z Ling, Z Xiaoqing, X Huang arXiv preprint arXiv:2310.16582, 2024	4	2024
Tooleyes: Fine-grained evaluation for tool learning capabilities of large language models in real-world scenarios J Ye, G Li, S Gao, C Huang, Y Wu, S Li, X Fan, S Dou, Q Zhang, T Gui, ... arXiv preprint arXiv:2401.00741, 2024	3	2024
Open the Pandora's Box of LLMs: Jailbreaking LLMs through Representation Engineering T Li, S Dou, W Liu, M Wu, C Lv, X Zheng, X Huang arXiv preprint arXiv:2401.06824, 2024	2	2024
Delve into ppo: Implementation matters for stable rlhf R Zheng, S Dou, S Gao, Y Hua, W Shen, B Wang, Y Liu, S Jin, Y Zhou, ... NeurIPS 2023 Workshop on Instruction Tuning and Instruction Following, 2023	2	2023
Gitor: Scalable Code Clone Detection by Building Global Sample Graph J Shan, S Dou, Y Wu, H Wu, Y Liu arXiv preprint arXiv:2311.08778, 2023	2	2023
Improving generalization of alignment with human preferences through group invariant learning R Zheng, W Shen, Y Hua, W Lai, S Dou, Y Zhou, Z Xi, X Wang, H Huang, ... arXiv preprint arXiv:2310.11971, 2023	2	2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors