Jing ZHANG

Associate Professor
Department of Computer Science and Technology
School of Information
Renmin University of China

Email: zhang-jing AT ruc DOT edu DOT cn

Address: Room 427, Information Building, Renmin University of China, Beijing


Publications | Talks | Services


I am an associate professor at School of Information, Renmin University of China. Prior to that, I received my Ph.D. degree from Department of Computer Science and Technology, Tsinghua University under supervision of Professor Jie Tang and Professor Juanzi Li. My research direction is data mining and knowledge discovery. My current research revolves around tailoring large language models to suit specific domains, investigating model compression methods, and improving inference efficiency. I prioritize the application of these approaches to structured data and the optimization of databases. More information about my experience and research can be found on Google Scholar.

I am looking for highly-motivated students to work with me. If interested, please drop me a message by email.


Publications

  • all : conference | journal | pre-print
  • large language model : fine-tuning | knowledge augmentation | LLM Compression | LLM for Structured Data | evaluation
  • knowledge reasoning : entity linking | entity alignment | knowledge graph completion | question answering | dialogue | text2sql
  • graph representation learning : gnn | pre-training and SSL for gnn | probabilistic graphic model | node similarity
  • social computing : recommendation | social influence | link prediction
    1. TableLLM: Enabling Tabular Data Manipulation by LLMs in Real Office Usage Scenarios .
      Xiaokang Zhang, Jing Zhang*, Zeyao Ma, Yang Li, Bohan Zhang, Guanlin Li, Zijun Yao, Kangli Xu, Jinchang Zhou, Daniel Zhang-Li, Jifan Yu, Shu Zhao, Juanzi Li, Jie Tang
      arXiv:2403.19318.
      code&model&data&demo
    2. Compressing Large Language Models by Streamlining the Unimportant Layer.
      Xiaodong Chan, Yuxuan Hu, Jing Zhang*
      arXiv:2403.19135.
    3. Large Language Model for Table Processing: A Survey.
      Weizheng Lu, Jiaming Zhang, Jing Zhang*, Yueguo Chen
      arXiv:2402.05121.
    4. CodeS: Towards Building Open-source Language Models for Text-to-SQL.
      Haoyang Li, Jing Zhang*, Hanbing Liu, Ju Fan, Xiaokang Zhang, Jun Zhu, Renjie Wei, Hongyan Pan, Cuiping Li, Hong Chen.
      SIGMOD'24.
      code&model&data
    5. Open-World Semi-Supervised Learning for Node Classification.
      Yanling Wang, Jing Zhang*, Lingxi Zhang, Lixin Liu, Yuxiao Dong, Cuiping Li, Hong Chen, Hongzhi Yin.
      ICDE'24.
    6. A Cause-Effect Look at Alleviating Hallucination of Knowledge-grounded Dialogue Generation.
      Jifan Yu, Xiaohan Zhang, Yifan Xu, Xuanyu Lei, Zijun Yao, Jing Zhang, Lei Hou, Juanzi Li
      LREC-COLING'24.
    7. Diversifying Question Generation over Knowledge Base via External Natural Questions.
      Shasha Guo, Jing Zhang*, Xirui Ke, Cuiping Li, Hong Chen
      LREC-COLING'24.
    8. SGSH: Stimulate Large Language Models with Skeleton Heuristics for Knowledge Base Question Generation.
      Shasha Guo, Lizi Liao, Jing Zhang*, Yanling Wang, Cuiping Li, Hong Chen
      Findings of NAACL'24.
    9. Transformer Compression via Subspace Projection.
      Yuxuan Hu, Jing Zhang*, Chen Zhao, Cuiping Li, Hong Chen.
      arXiv:2308.16475.
    10. A Generation-based Deductive Method for Math Word Problems.
      Yuxuan Hu, Jing Zhang*, Haoyang Li, Cuiping Li, Hong Chen.
      EMNLP'23.
    11. FFAEval: Evaluating Dialogue System via Free-For-All Ranking.
      Zeyao Ma, Zijun Yao, Jing Zhang*, Jifan Yu, Xiaohan Zhang, Juanzi Li, Jie Tang.
      Findings of EMNLP'23.
    12. GLM-Dialog: Noise-tolerant Pre-training for Knowledge-grounded Dialogue Generation.
      Jing Zhang, Xiaokang Zhang, Daniel Zhang-Li, Jifan Yu, Zijun Yao, Zeyao Ma, Yiqi Xu, Haohua Wang, Xiaohan Zhang, Nianyi Lin, Sunrui Lu, Juanzi Li, Jie Tang.
      KDD'23 (ADS).
    13. Web-Scale Academic Name Disambiguation: the WhoIsWho Benchmark, Leaderboard, and Toolkit.
      Bo Chen, Jing Zhang*, Fanjin Zhang, Tianyi Han, Yuqing Cheng, Xiaoyan Li, Yuxiao Dong, Jie Tang.
      KDD'23 (ADS).
    14. FC-KBQA: A Fine-to-Coarse Composition Framework for Knowledge Base Question Answering.
      Lingxi Zhang, Jing Zhang*, Yanling Wang, Shulin Cao, Xinmei Huang, Cuiping Li, Hong Chen, Juanzi Li.
      ACL'23.
    15. Chain of Thought Prompting Elicits Knowledge Augmentation.
      Dingjun Wu, Jing Zhang*, Xinmei Huang.
      Findings of ACL'23.
    16. RESDSQL: Decoupling Schema Linking and Skeleton Parsing for Text-to-SQL.
      Haoyang Li, Jing Zhang*, Cuiping Li, Hong Chen.
      AAAI'23.
    17. A survey on complex factual question answering.
      Lingxi Zhang, Jing Zhang*, Xirui Ke, Haoyang Li, Xinmei Huang, Zhonghui Shao, Shulin Cao, Xin Lv.
      AI Open'22.
    18. Graph Contrastive Learning for Anomaly Detection.
      Bo Chen, Jing Zhang*, Xiaokang Zhang, Yuxiao Dong, Jian Song, Peng Zhang, Kaibo Xu, Evgeny Kharlamov, Jie Tang.
      TKDE'22.
    19. DSM: Question Generation over Knowledge Base via Modeling Diverse Subgraphs with Meta-learner.
      Shasha Guo, Jing Zhang*, Yanling Wang, Qianyi Zhang, Cuiping Li and Hong Chen.
      EMNLP'22.
    20. Knowledge-augmented Self-training of A Question Rewriter for Conversational Knowledge Base Question Answering.
      Xirui Ke, Jing Zhang*, Xin Lv, Yiqi Xu, Shulin Cao, Cuiping Li, Hong Chen and Juanzi Li.
      Findings of EMNLP'22 (Findings of the Association for Computational Linguistics: EMNLP 2022).
    21. XDAI: A Tuning-free Framework for Exploiting Pre-trained Language Models in Knowledge Grounded Dialogue Generation.
      Jifan Yu, Xiaohan Zhang, Yifan Xu, Xuanyu Lei, Xinyu Guan, Jing Zhang, Lei Hou, Juanzi Li, and Jie Tang.
      KDD'22.
    22. Subgraph Retrieval Enhanced Model for Multi-hop Knowledge Base Question Answering.
      Jing Zhang, Xiaokang Zhang, Jifan Yu, Jian Tang, Jie Tang, Cuiping Li, Hong Chen.
      ACL'22.
    23. HOSMEL: A Hot-Swappable Modulized Entity Linking Toolkit for Chinese.
      Daniel Zhang-Li, Jing Zhang*, Jifan Yu, Xiaokang Zhang, Peng Zhang, Jie Tang, Juanzi Li.
      ACL'22 (Demo).
    24. ClusterSCL: Cluster-Aware Supervised Contrastive Learning on Graphs.
      Yanling Wang, Jing Zhang*, Haoyang Li, Yuxiao Dong, Hongzhi Yin, Cuiping Li, Hong Chen.
      WWW'22.
    25. CODE: Contrastive Pre-training with Adversarial Fine-tuning for Zero-shot Expert Linking.
      Bo Chen, Jing Zhang*, Xiaokang Zhang, Xiaobin Tang, Lingfan Cai, Hong Chen, Cuiping Li, Peng Zhang, Jie Tang.
      AAAI'22.
    26. A Pretraining Numerical Reasoning Model for Ordinal Constrained Question Answering on Knowledge Base.
      Yu Feng, Jing Zhang*, Gaole He, Wayne Xin Zhao, Lemao Liu, Quan Liu, Cuiping Li, Hong Chen.
      Findings of EMNLP'21 (Findings of the Association for Computational Linguistics: EMNLP 2021).
    27. P-INT: A Path-based Interaction Model for Few-shot Knowledge Graph Completion.
      Jingwen Xu, Jing Zhang*, Xirui Ke, Yuxiao Dong, Hong Chen, Cuiping Li, Yongbin Liu.
      Findings of EMNLP'21 (Findings of the Association for Computational Linguistics: EMNLP 2021).
    28. Neural, symbolic and neural-symbolic reasoning on knowledge graphs.
      Jing Zhang, Bo Chen, Lingxi Zhang, Xirui Ke and Haipeng Ding.
      AI Open'21.
    29. Decoupling Representation Learning and Classification for GNN-based Anomaly Detection.
      Yanling Wang, Jing Zhang*, Hongzhi Yin, Cuiping Li, and Hong Chen.
      SIGIR'21 (Proceedings of the 44rd ACM International SIGIR Conference on Research and Development in Information Retrieval).
    30. OAG_know: Self-supervised Learning for Linking Knowledge Graphs.
      Xiao Liu, Li Mian, Yuxiao Dong, Fanjin Zhang, Jing Zhang, Jie Tang, Peng Zhang, Jibing Gong, and Kuansan Wang.
      TKDE'21 (IEEE Transaction on Knowledge and Data Engineering).
    31. GCC: Graph Contrastive Coding for Graph Neural Network Pre-Training.
      Jiezhong Qiu, Qibin Chen, Yuxiao Dong, Jing Zhang, Hongxia Yang, Ming Ding, Kuansan Wang, and Jie Tang.
      KDD'20 (Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining).
    32. BERT-INT: A BERT-based Interaction Model For Knowledge Graph Alignment.
      Xiaobin Tang, Jing Zhang*, Bo Chen, Yang Yang, Hong Chen, Cuiping Li.
      IJCAI'20 (Proceedings of the 29th International Joint Conference on Artificial Intelligence).
    33. CONNA: Addressing Name Disambiguation on The Fly.
      Bo Chen, Jing Zhang*, Jie Tang, Lingfan Cai, Zhaoyu Wang, Shu Zhao, Hong Chen, and Cuiping Li.
      TKDE'20 (IEEE Transaction on Knowledge and Data Engineering).
    34. Robust Network Enhancement from Flawed Networks.
      Jiarong Xu, Yang Yang, Chunping Wang, Zongtao Liu, Jing Zhang, Lei Chen and Jiangang Lu.
      TKDE'20 (IEEE Transaction on Knowledge and Data Engineering).
    35. JarKA: Modeling Attribute Interactions for Cross-lingual Knowledge Alignment.
      Bo Chen, Jing Zhang*, Xiaobin Tang, Hong Chen, Cuiping Li.
      PAKDD'20 (Proceedings of the 24th Pacific-Asia Conference on Knowledge Discovery and Data Mining).
    36. Graph Convolutional Network using a Reliability-based Feature Aggregation Mechanism.
      Yanling Wang, Cuiping Li, Jing Zhang, Peng Ni, Hong Chen.
      DASFAA'20 (Proceedings of the 5th International Conference on Database Systems for Advanced Applications).
    37. Trust Relationship Prediction in Alibaba E-Commerce Platform.
      Yukuo Cen, Jing Zhang*, Gaofei Wang, Yujie Qian, Chuizheng Meng, Zonghong Dai, Hongxia Yang, Jie Tang.
      TKDE'19 (IEEE Transaction on Knowledge and Data Engineering).
    38. MEgo2Vec: Embedding Matched Ego Networks for User Alignment Across Social Networks.
      Jing Zhang, Bo Chen, Xianming Wang, Hong Chen, Cuiping Li, Fengmei Jin, Guojie Song and Yutao Zhang.
      CIKM'18 (Proceedings of the International Conference on Information and Knowledge Management).
    39. Fast and Flexible Top-k Similarity Search on Large Networks.
      Jing Zhang, Jie Tang, Cong Ma, Hanghang Tong, Yu Jing, Juanzi Li, Walter Luyten, and Marie-Francine Moens.
      TOIS'17 (ACM Transactions on Information Systems).
    40. Panther: Fast Top-k Similarity Search on Large Networks.
      Jing Zhang, Jie Tang, Cong Ma, Hanghang Tong, Yu Jing, and Juanzi Li.
      SIGKDD'15 (Proceedings of the 21rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining).
      code&data Slides_PPT Slides_PDF Poster
    41. A Unified Probabilistic Framework for Name Disambiguation in Digital Library.
      Jie Tang, A.C.M. Fong, Bo Wang, and Jing Zhang.
      TKDE'12.

    Students

    I'm very lucky to have the opportunities to work with these brilliant students (ordered by year of enrollment).

    Graduate Students

    1. Zhiming Yao, Ph.D. student, Fall 2024 -
    2. Yang Li, Ph.D. student, Fall 2024 -
    3. Bohan Zhang, M.S. student, Fall 2024 -
    4. Xiaodong Chen, M.S. student, Fall 2024 -
    5. Guanlin Li, M.S. student, Fall 2024 -
    6. Yuanchun Wang, Ph.D. student, Fall 2023 -
    7. Xinxin Zhao, Ph.D. student, Fall 2023 - (Co-supervised with Prof. Cuiping Li)
    8. Zeyao Ma, M.S. student, Fall 2023 -
    9. Xinmei Huang, M.S. student, Fall 2023 -
    10. Yuxuan Hu, Ph.D. student, Fall 2022 -
    11. Xiaokang Zhang, M.S. student, Fall 2022 -
    12. Zhonghui Shao, M.S. student, Fall 2022 -
    13. Haoyang Li, Ph.D. student, Fall 2021 - (Co-supervised with Prof. Cuiping Li)
    14. Xirui Ke, M.S. student, Fall 2021 -
    15. Lingxi Zhang, M.S. student, Fall 2021 -
    16. Shasha Guo, Ph.D. student, Fall 2020 - (Co-supervised with Prof. Cuiping Li)
    Alumni
    1. Yanling Wang, Ph.D., 2018-2023. Dissertation Title: Research on Graph Contrastive Learning for Complex Scenes (Co-supervised with Prof. Cuiping Li), First Employment: Zhongguancun Laboratory
    2. Jingwen Xu, M.S., 2020-2023. Thesis Title: Few-shot Knowledge Graph Completion Based on Path Interaction Supervised Learning and Large Language Model Prompt Learning, First Employment: China Construction Bank
    3. Chenxu Hu, M.S., 2020-2022. Thesis Title: Metadata Management under Multi-model Big Data Systems (Co-supervised with Prof. Feng Zhang), First Employment: People's Bank of China
    4. Xiaobin Tang, M.S., 2019-2022. Thesis Title: Interaction and Self-training based Methods for Entity Alignment, First Employment: China Tobacco
    5. Xiangying Cao, M.S., 2019-2021. Thesis Title: Entity Linking-based Disambiguation of Academic Institution Names, First Employment: People's Bank of China
    6. Lingfan Cai, M.S., 2018-2021. Thesis Title: The optimization of zero-shot entity linking, First Employment: CITIC Securities
    7. Bowen Hao, Ph.D., 2017-2022. Dissertation Title: A Research on User Profile Denoising and Cold-start Issue in Recommender System (Co-supervised with Prof. Cuiping Li). First Employment: Assistant Professor, Capital Normal University
    8. Bo Chen, M.S., 2017-2020. Thesis Title: Knowledge Graph Integration and Entity Linking from Multi-Sources (Co-supervised with Prof. Hong Chen). First Employment: Pursuing a Ph.D. in Computer Science and Technology at Tsinghua University

    Invited Talks

    1. 2024: Invited Talk about LLM4DB at Huawei.
    2. 2023: Invited Talk about StructruedDataLLM at the 11th China National Conference on Social Media Processing.
    3. 2023: Invited Talk about Integrating Structured Data With LLM at School of Computer Science & Techonlogy, Anhui University.
    4. 2023: Invited Talk about Integrating Structured Data With LLM by CCF CCF Database Special Committee.
    5. 2023: Invited Talk about ChatGPT at PingCAP.
    6. 2022: Invited Talk about Graph Contrastive Learning at BAAI Seminar 2022. slides
    7. 2022: Invited Talk about Knowledge Graph Question Answering on WAIC 2022. slides
    8. 2021: Invited Talk about Neural-Symbolic Reasoning on Knowledge Graphs on CCKS 2021. slides
    9. 2021: Invited Talk about Knowledge Graph Question Answering on CNCC 2021. slides
    10. 2021: Invited Talk about Graph Self-supervised Learning on CCAI 2021. slides
    11. 2021: Invited Talk about Neural-Symbolic Reasoning on Knowledge Graphs at BAAI Seminar. slides
    12. 2021: Invited Talk about Graph Self-supervised Learning at School of Computer Science & Technology, HUST.
    13. 2020: Invited Talk at NLPCC 2020 Student Workshop. slides
    14. 2019: Invited Talk at Sino-German International Seminar. slides
    15. 2018: Invited Talk at ACML 2018 Workshop on Machine Learning in Education. slides

    Professional Services

    Journal Editors:

    1. Associate Editor of IEEE TBD, 2023--
    2. Associate Editor of AI OPEN, 2020--

    Conference PC members:

    1. 2024: KDD, WWW, AAAI, ARR
    2. 2023: KDD, WWW (SPC), AAAI, ECML/PKDD (SPC)
    3. 2022: KDD, WWW, IJCAI, AAAI, WSDM
    4. 2021: KDD, WWW, IJCAI, ECML/PKDD (SPC)
    5. 2020: KDD, WWW, IJCAI, ECML/PKDD

    Journal Reviewers:

    1. TKDE, IEEE Transactions on Knowledge and Data Engineering
    2. TOIS, ACM Transactions on Information Systems
    3. TPAMI, Transactions on Pattern Analysis and Machine Intelligence
    4. TKDD, ACM Transactions on the Knowledge Discovery from Data
    5. TWEB, ACM Transactions on the Web
    6. TBD, IEEE Transactions on Big Data
    7. JCST, Journal of Computer Science and Technology
    8. JASIST, Journal of the Association for Information Science and Technology
    9. SCIENCE CHINA Information Sciences

    Teaching

    1. 2021- Data Structure and Algorithm
    2. 2020- Deep Learning

    Contact: zhang-jing AT ruc DOT edu DOT cn