Year | Conference Paper |
2025 | Pengcheng Jiang, Cao Xiao, Minhao Jiang, Parminder Bhatia, Taha Kass-Hout, Jimeng Sun, Jiawei Han, “Reasoning-Enhanced Healthcare Predictions with Knowledge Graph Community Retrieval”, in Proc. of 2025 Int. Conf. on Learning Representations (ICLR’25), April 2025 |
2025 | Bowen Jin, Jinsung Yoon, Jiawei Han, Sercan O. Arik, “Long-Context LLMs Meet RAG: Overcoming Challenges for Long Inputs in RAG”, in Proc. of 2025 Int. Conf. on Learning Representations (ICLR’25), April 2025 |
2025 | Ming Zhong, Aston Zhang, Xuewei Wang, Rui Hou, Wenhan Xiong, Chenguang Zhu, Zhengxing Chen, Liang Tan, Chloe Bi, Mike Lewis, Sravya Popuri, Sharan Narang, Melanie Kambadur, Dhruv Mahajan, Sergey Edunov, Jiawei Han, Laurens van der Maaten, “Law of the Weakest Link: Cross Capabilities of Large Language Models”, in Proc. of 2025 Int. Conf. on Learning Representations (ICLR’25), April 2025 |
2025 | Yu Zhang, Yanzhen Shen, SeongKu Kang, Xiusi Chen, Bowen Jin, Jiawei Han, “Chain-of-Factors Paper-Reviewer Matching”, in Proc. The Web Conference 2025 (WWW’25), April 2025 |
2025 | Yunyi Zhang, Ruozhen Yang, Xueqiang Xu, Rui Li, Jinfeng Xiao, Jiaming Shen, Jiawei Han, “TELEClass: Taxonomy Enrichment and LLM-Enhanced Hierarchical Text Classification with Minimal Supervision”, in Proc. The Web Conference 2025 (WWW’25), April 2025 |
2025 | Yizhu Jiao, Siru Ouyang, Ming Zhong, Yunyi Zhang, Linyi Ding, Sizhe Zhou, Jiawei Han, “Retrieval and Structuring Augmented Generation with Large Language Models for Web Applications”, (Conf. Tutorial), 2025 The Web Conference (WWW’25), April 2025 |
2025 | Pengcheng Jiang, Cao Xiao, Tianfan Fu, Parminder Bhatia, Taha Kass-Hout, Jimeng Sun, Jiawei Han, “Bi-level Contrastive Learning for Knowledge-Enhanced Molecule Representations”, Proc. of 2025 AAAI Conf. on Artificial Intelligence (AAAI’25), Feb. 2025 |
2025 | SeongKu Kang, Bowen Jin, Wonbin Kweon, Yu Zhang, Dongha Lee, Jiawei Han, Hwanjo Yu, “Improving Scientific Document Retrieval with Concept Coverage-based Query Set Generation”, in Proc. 2025 ACM Int. Conf. on Web Search and Data Mining (WSDM’25), March 2025. https://doi.org/10.1145/3701551.3703544 |
2025 | Fu, C., X. Li, B. Olson, H. Ji, and S. Ji, Fragment and Geometry Aware Tokenization of Molecules for Structure-Based Drug Design Using Language Models. 2025. Proc. The Thirteenth International Conference on Learning Representations (ICLR2025). https://openreview.net/forum?id=mMhZS7qt0U |
2025 | Arneson, K., L. Fu, L. Gatzke. Toward More Usable, Reproducible, and Sustainable Scientific Software: The Impact of User-Centered Design in Research Software Development. 2025. Proc. Platform for Advanced Scientific Computing. |
2025 | Nguyen, T., K.-H. Huang, G. Liu, M.D. Burke, Y. Diao, and H. Ji, FARM: Functional Group-Aware Representations for Small Molecules. , 2025. Proc. NAACL2025 Workshop on AI and Scientific Discovery: Directions and Opportunities. https://doi.org/10.48550/arXiv.2410.02082 |
2024 | Zhu, K., B.-W. Huang, B. Jin, Y. Jiao, M. Zhong, K. Chang, S.-D. Lin, and J. Han. Investigating Instruction Tuning Large Language Models on Graphs. in Conference on Language Modeling. 2024. https://doi.org/10.48550/arXiv.2408.05457 |
2024 | Zhou, S., Y. Meng, B. Jin, and J. Han. Grasping the Essentials: Tailoring Large Language Models for Zero-Shot Relation Extraction. in Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing. 2024. Association for Computational Linguistics. https://doi.org/10.18653/v1/2024.emnlp-main.747 |
2024 | Zhong, X., Y. Du, S. Ouyang, M. Zhong, T. Luo, Q. Ho, H. Peng, H. Ji, and J. Han. Actionie: Action extraction from scientific literature with programming languages. in Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 2024. Association for Computational Linguistics. https://doi.org/10.18653/v1/2024.acl-long.683 |
2024 | Zhao, J., C. Zhang, and Y. Luo. Contrastive fitness learning: Reprogramming protein language models for low-n learning of protein fitness landscape. in International Conference on Research in Computational Molecular Biology. 2024. Springer Nature Switzerland Cham. https://doi.org/10.1007/978-1-0716-3989-4_55 |
2024 | Zhang, Y., M. Zhong, S. Ouyang, Y. Jiao, S. Zhou, L. Ding, and J. Han. Automated Mining of Structured Knowledge from Text in the Era of Large Language Models. in Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 2024. https://doi.org/10.1145/3637528.3671469 |
2024 | Zhang, Y., X. Chen, B. Jin, S. Wang, S. Ji, W. Wang, and J. Han. A Comprehensive Survey of Scientific Large Language Models and Their Applications in Scientific Discovery. in Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing. 2024. Association for Computational Linguistics. https://doi.org/10.18653/v1/2024.emnlp-main.498 |
2024 | Zeng, Q., M. Sidhu, H.P. Chan, L. Wang, and H. Ji. Scientific Opinion Summarization: Paper Meta-review Generation Dataset, Methods, and Evaluation. in 1st AI4Research Workshop. 2024. International Joint Conferences on Artificial Intelligence Organization. https://doi.org/10.48550/arXiv.2305.14647 |
2024 | Yan, K., X. Li, H. Ling, K. Ashen, C. Edwards, R. Arróyave, M. Zitnik, H. Ji, X. Qian, and X. Qian. Invariant Tokenization of Crystalline Materials for Language Model Enabled Generation. in Advances in Neural Information Processing Systems. 2024. https://proceedings.neurips.cc/paper_files/paper/2024/file/e23133d34964a0a09f6d076fc4b922a4-Paper-Conference.pdf |
2024 | Xiao, J., L. Ding, J. Barry, M. Elkaref, G. De Mel, and J. Han. ORAG: Ontology-Guided Retrieval-Augmented Generation for Theme-Specific Entity Typing. in Conference on Language Modeling. 2024. https://openreview.net/forum?id=cKBmZ2PZ6c |
2024 | Wang, Q., Z. Zhang, H. Li, X. Liu, J. Han, H. Zhao, and H. Ji. Chem-FINESE: Validating fine-grained few-shot entity extraction through text reconstruction. in Findings of the Association for Computational Linguistics: EACL 2024. 2024. Association for Computational Linguistics. https://aclanthology.org/2024.findings-eacl.1/ |
2024 | Wang, Q., C. Edwards, H. Ji, and T. Hope. Towards a human-computer collaborative scientific paper lifecycle: A pilot study and hands-on tutorial. in Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024): Tutorial Summaries. 2024. ELRA and ICCL. https://aclanthology.org/2024.lrec-tutorials.10/ |
2024 | Wang, Q., D. Downey, H. Ji, and T. Hope. Scimon: Scientific inspiration machines optimized for novelty. in Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 2024. Association for Computational Linguistics. https://doi.org/10.18653/v1/2024.acl-long.18 |
2024 | Roy, S.G. and J. Han. ILCiteR: Evidence-grounded Interpretable Local Citation Recommendation. in Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024). 2024. ELRA and ICCL. https://aclanthology.org/2024.lrec-main.757/ |
2024 | Reddy, R.G., J. Doo, Y. Xu, M.A. Sultan, D. Swain, A. Sil, and H. Ji. FIRST: Faster Improved Listwise Reranking with Single Token Decoding. in Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing. 2024. Association for Computational Linguistics. https://doi.org/10.18653/v1/2024.emnlp-main.491 |
2024 | Ouyang, S., S. Wang, M. Jiang, M. Zhong, D. Yu, J. Han, and Y. Shen. Temperature-Centric Investigation of Speculative Decoding with Knowledge Distillation. in Findings of the Association for Computational Linguistics: EMNLP 2024. 2024. Association for Computational Linguistics. https://doi.org/10.18653/v1/2024.findings-emnlp.767 |
2024 | Ouyang, S., J. Huang, P. Pillai, Y. Zhang, Y. Zhang, and J. Han. Ontology enrichment for effective fine-grained entity typing. in Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 2024. https://doi.org/10.1145/3637528.3671857 |
2024 | Nguyen, T., T. Torres-Flores, C. Hwang, C. Edwards, Y. Diao, and H. Ji. GLaD: Synergizing Molecular Graphs and Language Descriptors for Enhanced Power Conversion Efficiency Prediction in Organic Photovoltaic Devices. in Proceedings of the 33rd ACM International Conference on Information and Knowledge Management. 2024. https://doi.org/10.1145/3627673.3680103 |
2024 | Liu, H., Q. Wang, P. Karisani, and H. Ji. Named Entity Recognition Under Domain Shift via Metric Learning for Life Sciences. in Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers). 2024. Association for Computational Linguistics. https://doi.org/10.18653/v1/2024.naacl-long.1 |
2024 | Komarlu, T., M. Jiang, X. Wang, and J. Han. OntoType: Ontology-Guided and Pre-Trained Language Model Assisted Fine-Grained Entity Typing. in Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 2024. https://doi.org/10.1145/3637528 |
2024 | Kang, S., Y. Zhang, P. Jiang, D. Lee, J. Han, and H. Yu. Taxonomy-guided Semantic Indexing for Academic Paper Search. in Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing. 2024. Association for Computational Linguistics. https://doi.org/10.18653/v1/2024.emnlp-main.407 |
2024 | Kang, S., S. Agarwal, B. Jin, D. Lee, H. Yu, and J. Han. Improving retrieval in theme-specific applications using a corpus topical taxonomy. in Proceedings of the ACM Web Conference 2024. https://doi.org/10.1145/3589334.3645512 |
2024 | Jin, B., Y. Zhang, S. Li, and J. Han. Bridging Text Data and Graph Data: Towards Semantics and Structure-aware Knowledge Discovery. in Proceedings of the 17th ACM International Conference on Web Search and Data Mining. 2024. https://doi.org/10.1145/3616855 |
2024 | Jin, B., C. Xie, J. Zhang, K.K. Roy, Y. Zhang, Z. Li, R. Li, X. Tang, S. Wang, Y. Meng, and J. Han. Graph Chain-of-Thought: Augmenting Large Language Models by Reasoning on Graphs. in Findings of the Association for Computational Linguistics: ACL 2024. 2024. Association for Computational Linguistics. https://doi.org/10.18653/v1/2024.findings-acl.11 |
2024 | Jin, B., Z. Pang, B. Guo, Y.-X. Wang, J. You, and J. Han. InstructG2I: Synthesizing Images from Multimodal Attributed Graphs. in Annual Conference on Neural Information Processing Systems. 2024. https://doi.org/10.48550/arXiv.2410.07157 |
2024 | Jiao, Y., S. Li, S. Zhou, H. Ji, and J. Han. TEXT2DB: Integration-Aware Information Extraction with Large Language Model Agents. in Findings of the Association for Computational Linguistics ACL 2024. 2024. Association for Computational Linguistics. https://doi.org/10.18653/v1/2024.findings-acl.12 |
2024 | Edwards, C., Q. Wang, L. Zhao, and H. Ji. L+M-24: Building a Dataset for Language+Molecules @ ACL 2024. in Proceedings of the 1st Workshop on Language + Molecules (L+M 2024). 2024. Association for Computational Linguistics. https://doi.org/10.18653/v1/2024.langmol-1.1 |
2024 | Edwards, C., Q. Wang, and H. Ji. Language + Molecules. in Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics: Tutorial Abstracts. 2024. Association for Computational Linguistics. https://aclanthology.org/2024.eacl-tutorials.3/ |
2024 | Edwards, C., A. Naik, T. Khot, M.D. Burke, H. Ji, and T. Hope. SynerGPT: In-Context Learning for Personalized Drug Synergy Prediction and Drug Design. in Conference on Language Modeling. 2024. https://doi.org/10.48550/arXiv.2307.11694 |
2024 | Ding, L., J. Xiao, S. Zhou, C. Yang, and J. Han. Topic-Oriented Open Relation Extraction with A Priori Seed Generation. in Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing. 2024. Association for Computational Linguistics. https://doi.org/10.18653/v1/2024.emnlp-main.766 |
2024 | Pengfei Yu and Heng Ji. 2024. Information Association for Language Model Updating by Mitigating LM-Logical Discrepancy. In Proceedings of the 28th Conference on Computational Natural Language Learning, pages 117–129, Miami, FL, USA. Association for Computational Linguistics. https://doi.org/10.18653/v1/2024.conll-1.10 |
2024 | Ghaffari, S., E. Saleh, A. Schwing, Y.-X. Wang, M.D. Burke, and S. Sinha. Robust Model-Based Optimization for Challenging Fitness Landscapes. in International Conference on Learning Representations. 2024. https://openreview.net/forum?id=xhEN0kJh4q |
2023 | Zhou, S., S. Ge, J. Shen, and J. Han. Corpus-based relation extraction by identifying and refining relation patterns. in Joint European Conference on Machine Learning and Knowledge Discovery in Databases. 2023. Springer Nature Switzerland Cham. https://doi.org/10.1007/978-3-031-43421-1_2 |
2023 | Zhong, M., S. Ouyang, Y. Jiao, P. Kargupta, L. Luo, Y. Shen, B. Zhou, X. Zhong, X. Liu, H. Li, J. Xiao, M. Jiang, X. Wang, H. Ji, M.D. Burke, H. Zhao and J. Han. Reaction miner: An integrated system for chemical reaction extraction from textual data. in Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: System Demonstrations. 2023. Association for Computational Linguistics. https://doi.org/10.18653/v1/2023.emnlp-demo.36 |
2023 | Zhong, M., S. Ouyang, M. Jiang, V. Hu, Y. Jiao, X. Wang, and J. Han. ReactIE: Enhancing Chemical Reaction Extraction with Weak Supervision. in Findings of the Association for Computational Linguistics: ACL 2023. 2023. Association for Computational Linguistics. https://doi.org/10.18653/v1/2023.findings-acl.767 |
2023 | Zhao, L., C. Edwards, and H. Ji. What a Scientific Language Model Knows and Doesn’t Know about Chemistry. in NeurIPS 2023 AI for Science Workshop. 2023. https://openreview.net/forum?id=hSmn7BQZ2v¬eId=Nr11sAV2kF |
2023 | Zhang, Y., Y. Zhang, M. Michalski, Y. Jiang, Y. Meng, and J. Han. Effective seed-guided topic discovery by integrating multiple types of contexts. in Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining. 2023. https://doi.org/10.1145/3539597.3570475 |
2023 | Zhang, Y., Y. Zhang, and J. Han. Mining Structures from Massive Texts by Exploring the Power of Pre-trained Language Models. in EDBT. 2023. https://doi.org/10.48786/EDBT.2023.81 |
2023 | Zhang, Y., B. Jin, Q. Zhu, Y. Meng, and J. Han. The effect of metadata on scientific literature tagging: A cross-field cross-model study. in Proceedings of the ACM Web Conference 2023. 2023. https://doi.org/10.1145/3543507.3583354 |
2023 | Zhang, Y., B. Jin, X. Chen, Y. Shen, Y. Zhang, Y. Meng, and J. Han. Weakly supervised multi-label classification of full-text scientific papers. in Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 2023. https://doi.org/10.1145/3580305.3599544 |
2023 | Zhang, Y., M. Jiang, Y. Meng, Y. Zhang, and J. Han. Pieclass: Weakly-supervised text classification with prompting and noise-robust iterative ensemble training. in Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing. 2023. Association for Computational Linguistics. https://doi.org/10.18653/v1/2023.emnlp-main.780 |
2023 | Yoon, S., Y. Meng, D. Lee, and J. Han. SCStory: Self-supervised and Continual Online Story Discovery. in Proceedings of the ACM Web Conference 2023. 2023. https://doi.org/10.1145/3543507.3583507 |
2023 | Yoon, S., D. Lee, Y. Zhang, and J. Han. Unsupervised story discovery from continuous news streams via scalable thematic embedding. in Proceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval. 2023. https://doi.org/10.1145/3539618.3591782 |
2023 | Yoon, S., H.P. Chan, and J. Han. Pdsum: Prototype-driven continuous summarization of evolving multi-document sets stream. in Proceedings of the ACM Web Conference 2023. 2023. https://doi.org/10.1145/3543507.3583371 |
2023 | Sprueill, H.W., C. Edwards, M.V. Olarte, U. Sanyal, H. Ji, and S. Choudhury. Monte Carlo Thought Search: Large Language Model Querying for Complex Scientific Reasoning in Catalyst Design. in Findings of the Association for Computational Linguistics: EMNLP 2023. 2023. Association for Computational Linguistics. https://doi.org/10.18653/v1/2023.findings-emnlp.560 |
2023 | Shah, A.K. and R. Zanibbi. Line-of-sight with graph attention parser (LGAP) for math formulas. in International Conference on Document Analysis and Recognition. 2023. Springer Nature Switzerland Cham. https://doi.org/10.1007/978-3-031-41734-4_25 |
2023 | Ouyang, S., S. Wang, Y. Liu, M. Zhong, Y. Jiao, D. Iter, R. Pryzant, C. Zhu, H. Ji, and J. Han. The Shifted and The Overlooked: A Task-oriented Investigation of User-GPT Interactions. in Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing. 2023. Association for Computational Linguistics. https://doi.org/10.18653/v1/2023.emnlp-main.146 |
2023 | Miao, S., Y. Luo, M. Liu, and P. Li. Interpretable Geometric Deep Learning via Learnable Randomness Injection. in International Conference on Learning Representation. 2023. https://doi.org/10.48550/arXiv.2210.16966 |
2023 | Meng, Y., M. Michalski, J. Huang, Y. Zhang, T. Abdelzaher, and J. Han. Tuning language models as training data generators for augmentation-enhanced few-shot learning. in International Conference on Machine Learning. 2023. PMLR. https://dl.acm.org/doi/10.5555/3618408.3619426 |
2023 | Meng, Y., J. Huang, Y. Zhang, Y. Zhang, and J. Han. Pretrained Language Representations for Text Understanding: A Weakly-Supervised Perspective. in Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 2023. https://doi.org/10.1145/3580305 |
2023 | Luo, J. and Y. Luo. Contrastive learning of protein representations with graph neural networks for structural and functional annotations. in Biocomputing. 2023. World Scientific. https://doi.org/10.1142/9789811270611_0011 |
2023 | Jin, X., B. Vinzamuri, S. Venkatapathy, H. Ji, and P. Natarajan. Adversarial robustness for large language NER models using disentanglement and word attributions. in Findings of the Association for Computational Linguistics: EMNLP 2023. 2023. Association for Computational Linguistics. https://doi.org/10.18653/v1/2023.findings-emnlp.830 |
2023 | Jin, B., Y. Zhang, Q. Zhu, and J. Han. Heterformer: Transformer-based deep node representation learning on heterogeneous text-rich networks. in Proceedings of the 29th ACM SIGKDD conference on knowledge discovery and data mining. 2023. https://doi.org/10.1145/3580305 |
2023 | Jin, B., Y. Zhang, Y. Meng, and J. Han. Edgeformers: Graph-empowered transformers for representation learning on textual-edge networks. in International Conference on Learning Representation. 2023. https://doi.org/10.48550/arXiv.2302.11050 |
2023 | Jin, B., W. Zhang, Y. Zhang, Y. Meng, X. Zhang, Q. Zhu, and J. Han. Patton: Language Model Pretraining on Text-Rich Networks. in Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 2023. Association for Computational Linguistics. https://doi.org/10.18653/v1/2023.acl-long.387 |
2023 | Jiao, Y., M. Zhong, J. Shen, Y. Zhang, C. Zhang, and J. Han. Unsupervised event chain mining from multiple documents. in Proceedings of the ACM Web Conference 2023. 2023. https://doi.org/10.1145/3543507 |
2023 | Jiao, Y., M. Zhong, S. Li, R. Zhao, S. Ouyang, H. Ji, and J. Han. Instruct and Extract: Instruction Tuning for On-Demand Information Extraction. in Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing. 2023. Association for Computational Linguistics. https://doi.org/10.18653/v1/2023.emnlp-main.620 |
2023 | Jiang, P., S. Agarwal, B. Jin, X. Wang, J. Sun, and J. Han. Text Augmented Open Knowledge Graph Completion via Pre-Trained Language Models. in Findings of the Association for Computational Linguistics: ACL 2023. 2023. Association for Computational Linguistics. https://doi.org/10.18653/v1/2023.findings-acl.709 |
2023 | Guan, J., W.W. Qian, X. Peng, Y. Su, J. Peng, and J. Ma. 3D Equivariant Diffusion for Target-Aware Molecule Generation and Affinity Prediction. in International Conference on Learning Representation. 2023. https://doi.org/10.48550/arXiv.2303.03543 |
2023 | Ge, S., J. Huang, Y. Meng, and J. Han. FineSum: Target-Oriented, Fine-Grained Opinion Summarization. in Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining. 2023. https://doi.org/10.1145/3539597.3570397 |
2023 | Chan, H.P., Q. Zeng, and H. Ji. Interpretable Automatic Fine-grained Inconsistency Detection in Text Summarization. in Findings of the Association for Computational Linguistics: ACL 2023. 2023. Association for Computational Linguistics. https://doi.org/10.18653/v1/2023.findings-acl.402 |
2023 | Balepur, N., S. Agarwal, K.V. Ramanan, S. Yoon, D. Yang, and J. Han. DynaMiTE: Discovering explosive topic evolutions with user guidance. in Findings of the Association for Computational Linguistics: ACL 2023. 2023. Association for Computational Linguistics. https://doi.org/10.18653/v1/2023.findings-acl.14 |
2022 | Zhong, M., Y. Liu, D. Yin, Y. Mao, Y. Jiao, P. Liu, C. Zhu, H. Ji, and J. Han. Towards a Unified Multi-Dimensional Evaluator for Text Generation. in Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing. 2022. Association for Computational Linguistics. https://doi.org/10.18653/v1/2022.emnlp-main.131 |
2022 | Zhong, M., Y. Liu, S. Ge, Y. Mao, Y. Jiao, X. Zhang, Y. Xu, C. Zhu, M. Zeng, and J. Han. Unsupervised Multi-Granularity Summarization. in Findings of the Association for Computational Linguistics: EMNLP 2022. 2022. Association for Computational Linguistics. https://doi.org/10.18653/v1/2022.findings-emnlp.366 |
2022 | Zhang, Y., Y. Meng, X. Wang, S. Wang, and J. Han. Seed-Guided Topic Discovery with Out-of-Vocabulary Seeds. in Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies. 2022. Association for Computational Linguistics. https://doi.org/10.18653/v1/2022.naacl-main.21 |
2022 | Zhang, Y., F. Guo, J. Shen, and J. Han. Unsupervised key event detection from massive text corpora. in Proceedings of the 28th ACM SIGKDD conference on knowledge discovery and data mining. 2022. https://doi.org/10.1145/3534678.3539395 |
2022 | Zhang, Y., S. Garg, Y. Meng, X. Chen, and J. Han. MotifClass: Weakly Supervised Text Classification with Higher-order Metadata Information. in Proceedings of the Fifteenth ACM International Conference on Web Search and Data Mining. 2022. https://doi.org/10.1145/3488560.3498384 |
2022 | Wang, X., H. Wang, H. Ji, and J. Han. New frontiers of scientific text mining: tasks, data, and tools. in Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 2022. https://doi.org/10.1145/3534678.3542606 |
2022 | Wang, X., H. Wang, H. Ji, and J. Han. Modern natural language processing techniques for scientific web mining: tasks, data, and tools. in Proceedings of the ACM Web Conference 2022. 2022. https://blender.cs.illinois.edu/paper/wwwtutorial2022.pdf |
2022 | Wang, X., V. Hu, M. Jiang, Y. Zhang, J. Xiao, D.C. Loving, H. Ji, M. Burke, and J. Han. REACTCLASS: cross-modal supervision for subword-guided reactant entity classification. in 2022 IEEE International Conference on Bioinformatics and Biomedicine (BIBM). 2022. IEEE. https://doi.ieeecomputersociety.org/10.1109/BIBM55620.2022.9995489 |
2022 | Wang, H., W. Li, X. Jin, K. Cho, H. Ji, J. Han, and M.D. Burke. Chemical-Reaction-Aware Molecule Representation Learning. in International Conference on Learning Representation. 2022. https://openreview.net/forum?id=6sh3pIzKS- |
2022 | Meng, Y., Y. Zhang, J. Huang, Y. Zhang, and J. Han. Topic discovery via latent space clustering of pretrained language model representations. in Proceedings of the ACM Web Conference 2022. 2022. https://doi.org/10.1145/3485447.3512034 |
2022 | Meng, Y., J. Huang, Y. Zhang, and J. Han. Generating training data with language models: Towards zero-shot language understanding. in Advances in Neural Information Processing Systems. 2022. https://proceedings.neurips.cc/paper_files/paper/2022/file/0346c148ba1c21c6b4780a961ea141dc-Paper-Conference.pdf |
2022 | Meng, Y., J. Huang, Y. Zhang, and J. Han. Adapting Pretrained Representations for Text Mining. in Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 2022. https://doi.org/10.1145/3534678 |
2022 | Lee, D., J. Shen, S. Lee, S. Yoon, H. Yu, and J. Han. Topic Taxonomy Expansion via Hierarchy-Aware Topic Phrase Generation. in Findings of the Association for Computational Linguistics: EMNLP 2022. 2022. Association for Computational Linguistics. https://doi.org/10.18653/v1/2022.findings-emnlp.122 |
2022 | Lee, D., J. Shen, S. Kang, S. Yoon, J. Han, and H. Yu. Taxocom: Topic taxonomy completion with hierarchical discovery of novel topic clusters. in Proceedings of the ACM Web Conference 2022. 2022. https://doi.org/10.1145/3485447.3512002 |
2022 | Jiao, Y., S. Li, Y. Xie, M. Zhong, H. Ji, and J. Han. Open-Vocabulary Argument Role Prediction For Event Extraction. in Findings of the Association for Computational Linguistics: EMNLP 2022. 2022. Association for Computational Linguistics. https://doi.org/10.18653/v1/2022.findings-emnlp.395 |
2022 | Hwang, C., S. Yi, D. Friday, N.H. Angello, T.C. Torres-Flores, N.E. Jackson, M.D. Burke, C.M Schroeder, and Y. Diao. Autonomous Materials Discovery for Organic Photovoltaics. in AI for Accelerated Materials Design NeurIPS 2022 Workshop. 2022. https://openreview.net/forum?id=RfJOs4EMfjj |
2022 | Huang, J., Y. Meng, and J. Han. Few-shot fine-grained entity typing with automatic label interpretation and instance generation. in Proceedings of the 28th ACM SIGKDD conference on knowledge discovery and data mining. 2022. https://doi.org/10.1145/3534678 |
2022 | Guan, J., W.W. Qian, Q. Liu, W.-Y. Ma, J. Ma, and J. Peng. Energy-inspired molecular conformation optimization. in International Conference on Learning Representation. 2022. https://openreview.net/forum?id=7QfLW-XZTl |
2022 | Gu, X., Y. Shen, J. Shen, J. Shang, and J. Han. Phrase-aware Unsupervised Constituency Parsing. in Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 2022. Association for Computational Linguistics. https://doi.org/10.18653/v1/2022.acl-long.444 |
2022 | Edwards, C., T. Lai, K. Ros, G. Honke, K. Cho, and H. Ji. Translation between Molecules and Natural Language. in Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing. 2022. Association for Computational Linguistics. https://doi.org/10.18653/v1/2022.emnlp-main.26 |
2022 | Agarwal, S., R. Sawhney, M. Thakkar, P. Nakov, J. Han, and T. Derr. Think: Temporal hypergraph hyperbolic network. in 2022 IEEE International Conference on Data Mining (ICDM). 2022. IEEE. https://doi.ieeecomputersociety.org/10.1109/ICDM54844.2022.00096 |
2021 | Zhu, Q., C. Yang, Y. Xu, H. Wang, C. Zhang, and J. Han. Transfer learning of graph neural networks with ego-graph information maximization. in Advances in Neural Information Processing Systems. 2021. https://proceedings.neurips.cc/paper_files/paper/2021/file/0dd6049f5fa537d41753be6d37859430-Paper.pdf |
2021 | Zhu, Q., N. Ponomareva, J. Han, and B. Perozzi. Shift-Robust GNNs: Overcoming the Limitations of Localized Graph Training data. in Advances in Neural Information Processing Systems. 2021. https://proceedings.neurips.cc/paper_files/paper/2021/hash/eb55e369affa90f77dd7dc9e2cd33b16-Abstract.html |
2021 | Zhang, Z., N.N. Parulian, H. Ji, A.S. Elsayed, S. Myers, and M. Palmer. Fine-grained Information Extraction from Biomedical Literature based on Knowledge-enriched Abstract Meaning Representation. in Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). 2021. Association for Computational Linguistics. https://doi.org/10.18653/v1/2021.acl-long.489 |
2021 | Zhang, X., C. Zhang, X.L. Dong, J. Shang, and J. Han. Minimally-supervised structure-rich text categorization via learning on text-rich networks. in Proceedings of the Web Conference 2021. 2021. https://doi.org/10.1145/3442381.3450114 |
2021 | Xie, Y., J. Shen, S. Li, Y. Mao, and J. Han. Eider: Empowering Document-level Relation Extraction with Efficient Evidence Extraction and Inference-stage Fusion. in Findings of the Association for Computational Linguistics: ACL 2022. 2021. https://doi.org/10.18653/v1/2022.findings-acl.23 |
2021 | Wang, X., V. Hu, X. Song, S. Garg, J. Xiao, and J. Han. ChemNER: fine-grained chemistry named entity recognition with ontology-guided distant supervision. in Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing. 2021. Association for Computational Linguistics. https://doi.org/10.18653/v1/2021.emnlp-main.424 |
2021 | Sun, C., W. Li, J. Xiao, N.N. Parulian, C. Zhai, and H. Ji. Fine-grained chemical entity typing with multimodal knowledge representation. in 2021 IEEE International Conference on Bioinformatics and Biomedicine (BIBM). 2021. IEEE. https://doi.ieeecomputersociety.org/10.1109/BIBM52615.2021.9669360 |
2021 | Shen, J., Y. Zhang, H. Ji, and J. Han. Corpus-based open-domain event type induction. in Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing. 2021. Association for Computational Linguistics. https://doi.org/10.18653/v1/2021.emnlp-main.441 |
2021 | Shah, A.K., A. Dey, and R. Zanibbi. A math formula extraction and evaluation framework for pdf documents. in Document Analysis and Recognition–ICDAR 2021: 16th International Conference, Lausanne, Switzerland, September 5–10, 2021, Proceedings, Part II 16. 2021. Springer International Publishing. https://doi.org/10.1007/978-3-030-86331-9_2 |
2021 | Meng, Y., Y. Zhang, J. Huang, X. Wang, Y. Zhang, H. Ji, and J. Han. Distantly-Supervised Named Entity Recognition with Noise-Robust Learning and Language Model Augmented Self-Training. in Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing. 2021. Association for Computational Linguistics. https://doi.org/10.18653/v1/2021.emnlp-main.810 |
2021 | Meng, Y., J. Huang, Y. Zhang, and J. Han. On the Power of Pre-Trained Text Representations: Models and Applications in Text Mining. in Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining. 2021. https://doi.org/10.1145/3447548.3470810 |
2021 | Mao, Y., W. Ma, D. Lei, J. Han, and X. Ren. Extract, Denoise and Enforce: Evaluating and Improving Concept Preservation for Text-to-Text Generation. in Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing. 2021. Association for Computational Linguistics. https://doi.org/10.18653/v1/2021.emnlp-main.413 |
2021 | Lai, T., H. Ji, C. Zhai, and Q.H. Tran. Joint Biomedical Entity and Relation Extraction with Knowledge-Enhanced Collective Inference. in Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers). 2021. Association for Computational Linguistics. https://doi.org/10.18653/v1/2021.acl-long.488 |
2021 | Lai, T., H. Ji, and C. Zhai. BERT might be Overkill: A Tiny but Effective Biomedical Entity Linker based on Residual Convolutional Neural Networks. in Findings of the Association for Computational Linguistics: EMNLP 2021. 2021. Association for Computational Linguistics. https://doi.org/10.18653/v1/2021.findings-emnlp.140 |
2021 | Gu, X., Z. Wang, Z. Bi, Y. Meng, L. Liu, J. Han, and J. Shang. Ucphrase: Unsupervised context-aware quality phrase tagging. in Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining. 2021. https://doi.org/10.1145/3447548.3467397 |
2021 | Edwards, C., C. Zhai, and H. Ji. Text2mol: Cross-modal molecule retrieval with natural language queries. in Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing. 2021. Association for Computational Linguistics. https://doi.org/10.18653/v1/2021.emnlp-main.47 |
2021 | Dey, A. and R. Zanibbi. ScanSSD-XYc: faster detection for math formulas. in Document Analysis and Recognition–ICDAR 2021 Workshops: Lausanne, Switzerland, September 5–10, 2021, Proceedings, Part I 16. 2021. Springer. https://doi.org/10.1007/978-3-030-86198-8_7 |