Research and Publication

Publications

Gokhale, Tejas, Pratyay Banerjee, Chitta Baral, and Yezhou Yang. “VQA-LOL: Visual question answering under the lens of logic.” In European Conference on Computer Vision (2020).

Wang, Zhe, Zhiyuan Fang, Jun Wang, and Yezhou Yang. “ViTAA: Visual-Textual Attributes Alignment in Person Search by Natural Language.” In European Conference on Computer Vision (2020).

Bajestani, Mohammad Farhadi, and Yezhou Yang. “Tkd: Temporal knowledge distillation for active perception.” In The IEEE Winter Conference on Applications of Computer Vision, pp. 953-962. 2020.

Gunasekar, Kausic, Qiang Qiu, and Yezhou Yang. “Low to High Dimensional Modality Hallucination Using Aggregated Fields of View.” IEEE Robotics and Automation Letters 5, no. 2 (2020): 1983-1990.

Ye, Xin, Zhe Lin, Joon-Young Lee, Jianming Zhang, Shibin Zheng, and Yezhou Yang. “Gaple: Generalizable approaching policy learning for robotic object searching in indoor environment.” IEEE Robotics and Automation Letters 4, no. 4 (2019): 4003-4010.

Ye, Xin, Zhe Lin, and Yezhou Yang. “Robot learning of manipulation activities with overall planning through precedence graph.” Robotics and Autonomous Systems 116 (2019): 126-135.

Fang, Zhiyuan, Shu Kong, Charless Fowlkes, and Yezhou Yang. “Modularized Textual Grounding for Counterfactual Resilience.” IEEE International Conference on Computer Vision and Pattern Recognition (CVPR) (2019).

Aditya, Somak, Yezhou Yang, and Chitta Baral. “Integrating knowledge and reasoning in image understanding.” In Proceedings of the 28th International Joint Conference on Artificial Intelligence, pp. 6252-6259. AAAI Press, 2019.

Farhadi, Mohammad, Mehdi Ghasemi, and Yezhou Yang. “A novel design of adaptive and hierarchical convolutional neural networks using partial reconfiguration on fpga.” In 2019 IEEE High Performance Extreme Computing Conference (HPEC), pp. 1-7. IEEE, 2019.

Ren, Yi, Steven Elliott, Yiwei Wang, Yezhou Yang, and Wenlong Zhang. “How Shall I Drive? Interaction Modeling and Motion Planning towards Empathetic and Socially-Graceful Driving.” IEEE International Conference on Robotics and Automation (ICRA) (2019).

Aditya, Somak, Rudra Saha, Yezhou Yang, and Chitta Baral. “Spatial Knowledge Distillation to aid Visual Reasoning.” In 2019 IEEE Winter Conference on Applications of Computer Vision (WACV), pp. 227-235. IEEE, 2019.

Xin Ye, Zhe Lin, Haoxiang Li, Shibin Zheng and Yezhou Yang. Active Object Perceiver: Recognition-guided Policy Learning for Object Searching on Mobile Robots, the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2018

Izadyyazdanabadi, Mohammadhassan and Belykh, Evgenii and Mooney, Michael and Martirosyan, Nikolay and Eschbacher, Jennifer and Nakaji, Peter and Preul, Mark C and Yang, Yezhou. Convolutional Neural Networks: Ensemble Modeling, Fine-Tuning and Unsupervised Semantic Localization for Neurosurgical CLE Images, Journal of Visual Communication and Image Representation (JVCI), Vol. 50, Page 10-20, 2018

Somak Aditya, Chitta Baral, Yezhou Yang, Cornelia Fermuller, Yiannis Aloimonos. Image Understanding using vision and reasoning through Scene Description Graph., Journal of Computer Vision and Image Understanding (CVIU) (2018)

Izadyyazdanabadi, Mohammadhassan, Evgenii Belykh, Michael Mooney, Jennifer Eschbacher, Peter Nakaji, Yezhou Yang, and Mark Preul. ”Prospects for Theranostics in Neurosurgical Imaging: Empowering Confocal Laser Endomicroscopy Diagnostics via Deep Learning.” Frontiers in Oncology 8 (2018): 240.

Somak Aditya, Yezhou Yang, Chitta Baral, Yiannis Aloimonos. Combining Knowledge and Reasoning through Probabilistic Soft Logic for Image Puzzle Solving, Conference on Uncertainty in Artificial Intelligence (UAI), 2018

Mohammadhassan Izadyyazdanabadi1, Evgenii Belykh, Claudio Cavallo, Xiao-chun Zhao, Sirin Gandhi, Leandro Borba Moreira, Jennifer Eschbacher, Peter Nakaji, Mark C. Preul and Yang, Yezhou. Weakly-Supervised Learning-Based Feature Localization in Confocal Laser Endomicroscopy Glioma Images., 21st International Conference On Medical Image Computing & Computer Assisted Intervention (MICCAI) 2018

Zunlei Feng, Zhenyun Yu, Yezhou Yang, Yongcheng Jing, Junxiao Jiang, and Mingli Song. ”Interpretable Partitioned Embedding for Customized Multi-item Fashion Outfit Composition.” In Proceedings of the 2018 ACM on International Conference on Multimedia Retrieval, pp. 143-151. ACM, 2018.

Jie Song, Chengchao Shen, Yezhou Yang, Yang Liu, and Mingli Song. Transductive Unbiased Embedding for Zero-Shot Learning. IEEE International Conference on Computer Vision and Pattern Recognition (CVPR) 2018.

Simon Stepputtis, Yezhou Yang, Heni Ben Amor. Extrinsic Dexterity through Active Slip Control using Deep Predictive Models, IEEE International Conference on Robotics and Automation (ICRA) 2018.

Somak Aditya, Yezhou Yang, Chitta Baral. Explicit Reasoning over End-to-End Neural Architectures for Visual Question Answering, the Thirty-Second AAAI Conference on Artificial Intelligence 2018.

Somak Aditya, Yezhou Yang, Chitta Baral, Yiannis Aloimonos, Cornelia Fermüller. Image Understanding using Vision and Reasoning through Scene Description Graph. Computer Vision and Image Understanding (CVIU) Dec 2017 . 2018.
Paper Poster Project

Somak Aditya, Yezhou Yang and Chitta Baral. Explicit Reasoning over End-to-End Neural Architectures for Visual Question Answering. The Thirty-Second AAAI Conference on Artificial Intelligence (AAAI-18). 2018.
Paper Poster Project

Wenlong Zhang, Yezhou Yang and Yi Ren. Towards Understanding Human Decisions in Human-Robot Interactions. ASME 2017 Dynamic Systems and Control Conference, 2017 (DSCC 17). 2017.
Paper Poster Project

Xin Ye*, Yiwei Wang*, Yezhou Yang and Wenlong Zhang. Collision-free Trajectory Planning in Human-robot Interaction through Hand Movement Prediction from Vision. The 2017 IEEE-RAS International Conference on Humanoid Robots (HUMANOIDS). 2017.
Paper Poster ProjectGokhale, Tejas, Pratyay Banerjee, Chitta Baral, and Yezhou Yang. “VQA-LOL: Visual question answering under the lens of logic.” In European Conference on Computer Vision (2020).

Wang, Zhe, Zhiyuan Fang, Jun Wang, and Yezhou Yang. “ViTAA: Visual-Textual Attributes Alignment in Person Search by Natural Language.” In European Conference on Computer Vision (2020).Bajestani, Mohammad Farhadi, and Yezhou Yang. “Tkd: Temporal knowledge distillation for active perception.” In The IEEE Winter Conference on Applications of Computer Vision, pp. 953-962. 2020.Gunasekar, Kausic, Qiang Qiu, and Yezhou Yang. “Low to High Dimensional Modality Hallucination Using Aggregated Fields of View.” IEEE Robotics and Automation Letters 5, no. 2 (2020): 1983-1990.Ye, Xin, Zhe Lin, Joon-Young Lee, Jianming Zhang, Shibin Zheng, and Yezhou Yang. “Gaple: Generalizable approaching policy learning for robotic object searching in indoor environment.” IEEE Robotics and Automation Letters 4, no. 4 (2019): 4003-4010.Ye, Xin, Zhe Lin, and Yezhou Yang. “Robot learning of manipulation activities with overall planning through precedence graph.” Robotics and Autonomous Systems 116 (2019): 126-135.Fang, Zhiyuan, Shu Kong, Charless Fowlkes, and Yezhou Yang. “Modularized Textual Grounding for Counterfactual Resilience.” IEEE International Conference on Computer Vision and Pattern Recognition (CVPR) (2019).Aditya, Somak, Yezhou Yang, and Chitta Baral. “Integrating knowledge and reasoning in image understanding.” In Proceedings of the 28th International Joint Conference on Artificial Intelligence, pp. 6252-6259. AAAI Press, 2019.Farhadi, Mohammad, Mehdi Ghasemi, and Yezhou Yang. “A novel design of adaptive and hierarchical convolutional neural networks using partial reconfiguration on fpga.” In 2019 IEEE High Performance Extreme Computing Conference (HPEC), pp. 1-7. IEEE, 2019.Ren, Yi, Steven Elliott, Yiwei Wang, Yezhou Yang, and Wenlong Zhang. “How Shall I Drive? Interaction Modeling and Motion Planning towards Empathetic and Socially-Graceful Driving.” IEEE International Conference on Robotics and Automation (ICRA) (2019).Aditya, Somak, Rudra Saha, Yezhou Yang, and Chitta Baral. “Spatial Knowledge Distillation to aid Visual Reasoning.” In 2019 IEEE Winter Conference on Applications of Computer Vision (WACV), pp. 227-235. IEEE, 2019.Xin Ye, Zhe Lin, Haoxiang Li, Shibin Zheng and Yezhou Yang. Active Object Perceiver: Recognition-guided Policy Learning for Object Searching on Mobile Robots, the IEEE/RSJ International Conference on Intelligent Robots and Systems, 2018Izadyyazdanabadi, Mohammadhassan and Belykh, Evgenii and Mooney, Michael and Martirosyan, Nikolay and Eschbacher, Jennifer and Nakaji, Peter and Preul, Mark C and Yang, Yezhou. Convolutional Neural Networks: Ensemble Modeling, Fine-Tuning and Unsupervised Semantic Localization for Neurosurgical CLE Images, Journal of Visual Communication and Image Representation (JVCI), Vol. 50, Page 10-20, 2018Somak Aditya, Chitta Baral, Yezhou Yang, Cornelia Fermuller, Yiannis Aloimonos. Image Understanding using vision and reasoning through Scene Description Graph., Journal of Computer Vision and Image Understanding (CVIU) (2018)Izadyyazdanabadi, Mohammadhassan, Evgenii Belykh, Michael Mooney, Jennifer Eschbacher, Peter Nakaji, Yezhou Yang, and Mark Preul. ”Prospects for Theranostics in Neurosurgical Imaging: Empowering Confocal Laser Endomicroscopy Diagnostics via Deep Learning.” Frontiers in Oncology 8 (2018): 240.Somak Aditya, Yezhou Yang, Chitta Baral, Yiannis Aloimonos. Combining Knowledge and Reasoning through Probabilistic Soft Logic for Image Puzzle Solving, Conference on Uncertainty in Artificial Intelligence (UAI), 2018Mohammadhassan Izadyyazdanabadi1, Evgenii Belykh, Claudio Cavallo, Xiao-chun Zhao, Sirin Gandhi, Leandro Borba Moreira, Jennifer Eschbacher, Peter Nakaji, Mark C. Preul and Yang, Yezhou. Weakly-Supervised Learning-Based Feature Localization in Confocal Laser Endomicroscopy Glioma Images., 21st International Conference On Medical Image Computing & Computer Assisted Intervention (MICCAI) 2018Zunlei Feng, Zhenyun Yu, Yezhou Yang, Yongcheng Jing, Junxiao Jiang, and Mingli Song. ”Interpretable Partitioned Embedding for Customized Multi-item Fashion Outfit Composition.” In Proceedings of the 2018 ACM on International Conference on Multimedia Retrieval, pp. 143-151. ACM, 2018.Jie Song, Chengchao Shen, Yezhou Yang, Yang Liu, and Mingli Song. Transductive Unbiased Embedding for Zero-Shot Learning. IEEE International Conference on Computer Vision and Pattern Recognition (CVPR) 2018.Simon Stepputtis, Yezhou Yang, Heni Ben Amor. Extrinsic Dexterity through Active Slip Control using Deep Predictive Models, IEEE International Conference on Robotics and Automation (ICRA) 2018.Somak Aditya, Yezhou Yang, Chitta Baral. Explicit Reasoning over End-to-End Neural Architectures for Visual Question Answering, the Thirty-Second AAAI Conference on Artificial Intelligence 2018.

Wenlong Zhang, Yezhou Yang and Yi Ren. Towards Understanding Human Decisions in Human-Robot Interactions. ASME 2017 Dynamic Systems and Control Conference, 2017 (DSCC 17). 2017.Paper Poster Project

Chengxi Ye, Yezhou Yang, Cornelia Fermüller and Yiannis Aloimonos. What Can I Do Around Here? Deep Functional Scene Understanding for Cognitive Robot. 2017 International Conference on Robotics and Automation (ICRA) . 2017.
Paper Poster Project

Wentao Luan, Yezhou Yang, Cornelia Fermüller and John Baras. Fast Task-Specific Target Detection via Graph Based Constraints Representation and Checking. 2017 International Conference on Robotics and Automation (ICRA) . 2017.
Paper Poster Project

Eren Erdal Aksoy, Ekaterina Ovchinnikova, Adil Orhan, Yezhou Yang and Tamim Asfour. Unsupervised Linking of Visual Features to Textual Descriptions in Long Manipulation Activities. 2017 International Conference on Robotics and Automation (ICRA) and Robotics and Automation Letters (RA-L) . 2017.
Paper Poster Project

Cornelia Fermüller, Fang Wang, Yezhou Yang, Konstantinos Zampogiannis, Yi Zhang, Francisco Barranco, Michael Pfeiffer. Prediction of Manipulation Actions. International Journal on Computer Vision (IJCV) . 2017.
Paper Poster Project

Publications before 2016 bib

Computer Vision | Robotics | AI, Computational Linguistics and Cognitive Systems | Other Publications

Computer Vision

Wentao Luan, Yezhou Yang, Cornelia Fermüller and John Baras. Reliable Attribute-Based Object Recognition Using High Predictive Value Classifiers. 2016 the 14th European Conference on Computer Vision (ECCV) . 2016.
Paper Poster Project

Yezhou Yang, Cornelia Fermüller, Yi Li and Yiannis Aloimonos. Grasp Type Revisited: A Modern Perspective on A Classical Feature for Vision. 2015 IEEE International Conference on Computer Vision and Pattern Recognition (CVPR) . 2015.
Paper Poster Project

Yezhou Yang, Cornelia Fermüller, Yiannis Aloimonos. Detection of Manipulation Action Consequences (MAC). IEEE International Conference on Computer Vision and Pattern Recognition, CVPR . 2013.Paper Poster Dataset ROS package

Xiaodong Yu, Ching Lik Teo, Yezhou Yang, Cornelia Fermüller, Yiannis Aloimonos. Action Attribute Detection from Sports Videos with Contextual Constraints.British Machine Vision Conference (BMVC). 2013.Paper Poster Dataset

Xiaodong Yu, Cornelia Fermüller, Ching L. Teo, Yezhou Yang, Yiannis Aloimonos. Active Scene Recognition with Vision and Language. International Conference on Computer Vision, ICCV. 2011.Paper Poster

Yezhou Yang, Mingli Song, N. Li, J. Bu, C. Chen; What is the Chance of Happening: a New Way to Predict Where People Look. The 11th European Conference on Computer Vision, ECCV. 2010.Paper

Robotics

Ren Mao, John Baras, Yezhou Yang, Cornelia Fermüller. Co-active Learning to Adapt Humanoid Movement for Manipulation. 2016 IEEE-RAS International Conference on Humanoid Robots (Humanoids) . 2016.
Paper Project

Konstantinos Zampogiannis, Yezhou Yang, Cornelia Fermüller and Yiannis Aloimonos. Learning the Spatial Semantics of Manipulation Actions through Preposition Grounding. 2015 IEEE International Conference on Robotics and Automation (ICRA) . 2015.
Paper Poster Project

Yezhou Yang, Anupam Guha, Cornelia Fermüller, Yiannis Aloimonos. Manipulation Action Tree Bank: A Knowledge Resource for Humanoids. IEEE-RAS International Conference on Humanoid Robots, Humanoids. 2014.Paper Slides Salad Making Tree Bank

Ren Mao, Yezhou Yang, Cornelia Fermüller, Yiannis Aloimonos. Learning Hand Movements from Markerless Demonstrations for Humanoid Tasks. IEEE-RAS International Conference on Humanoid Robots, Humanoids. 2014.Paper Slides

Yezhou Yang, Ching L. Teo, Cornelia Fermüller, Yiannis Aloimonos. Robots with Language: Multi-Label Visual Recognition Using NLP.IEEE International Conference on Robotics and Automation, ICRA. 2013.Paper Slides

Anupam Guha, Yezhou Yang, Cornelia Fermüller, Yiannis Aloimonos. Minimalist Plans for Interpreting Manipulation Actions.IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). 2013.Paper Slides

Douglas Summers-stay, Ching L. Teo, Yezhou Yang, Cornelia Fermüller, Yiannis Aloimonos. Using a Minimal Action Grammar for Activity Understanding in the Real World. IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS. 2012.Paper slides Dataset

Ching L. Teo, Yezhou Yang, Hal Daumé III, Cornelia Fermüller, Yiannis Aloimonos. Towards a Watson That Sees: Language-Guided Action Recognition for Robots. IEEE International Conference on Robotics and Automation, ICRA. 2012.Paper Slides Dataset

AI, Computational Linguistics and Cognitive Systems

Somak Aditya, Chitta Baral, Yezhou Yang, Cornelia Fermuller and Yiannis Aloimonos. DeepIU: An Architecture for Image Understanding. Advances in Cognitive Systems (ACS) 4 (2016). 2016.
Paper Poster Project

Yezhou Yang, Cornelia Fermüller, Yiannis Aloimonos and Eren Erdal Aksoy. Learning the Semantics of Manipulation Action. The 53rd Annual Meeting of the Association for Computational Linguistics (ACL) . 2015.
Paper Poster Project

Yezhou Yang, Yi Li, Cornelia Fermüller, Yiannis Aloimonos. Robot Learning Manipulation Action Plans by “Watching” Unconstrained Videos From the World Wide Web. the Twenty-Ninth AAAI Conference on Artificial Intelligence . 2015.
Abstract will appear at the Autonomously Learning Robots workshop at NIPS Conference. 2014.Paper Poster Baxter Demo

Somak Aditya, Yezhou Yang, Chitta Baral, Cornelia Fermüller, Yiannis Aloimonos. Visual common-sense for scene understanding using perception, semantic parsing and reasoning. Common-sense 2015, AAAI 2015 Spring Symposium .Paper Talk

Yezhou Yang, Anupam Guha, Cornelia Fermüller, Yiannis Aloimonos. A Cognitive System for Understanding Human Manipulation Actions. Advances in Cognitive Systems . (ISSN 2324-8416) 2014.
Yezhou Yang, Cornelia Fermüller, Yiannis Aloimonos. Interpreting Manipulation Actions: a Cognitive Approach. Vision Meets Cognition Workshop, CVPR. 2014.Paper Poster

Yezhou Yang, Ching L. Teo, Hal Daumé III and Yiannis Aloimonos. Corpus-Guided Sentence Generation of Natural Images. Conference on Empirical Methods in Natural Language Processing, EMNLP. 2011.Paper Slides Results

Ching L. Teo, Yezhou Yang, Hal Daumé III, Cornelia Fermüller and Yiannis Aloimonos. A Corpus-Guided Framework for Robotic Visual Perception. AAAI Workshop on Language-Action Tools for Cognitive Artificial Agents. 2011.Paper Slides Dataset

Chengxi Ye, Chen Zhao, Yezhou Yang, Cornelia Fermuller and Yiannis Aloimonos. LightNet: A Versatile, Standalone Matlab-based Environment for Deep Learning. The Open Source Software Competition, ACMMM. 2016.Paper Slides Github

Ching L. Teo, Yezhou Yang, Cornelia Fermüller, Yiannis Aloimonos. Synergistic Methods for using Language in Robotics. Performance Metrics for Intelligent Systems Workshop, PerMIS. 2012.Paper Slides Dataset

Yezhou Yang, Mingli Song, J. Bu, C. Chen, C. Jin. Color to Gray: Attention Preservation. The 4th Pacific-Rim Symposium on Image and Video Technology, PSIVT. 2010.Paper

Yezhou Yang, Mingli Song, N. Li, J. Bu, C. Chen; Visual attention analysis by pseudo gravitational field. ACM International Conference on Multimedia, ACMMM. 2009.Paper

Professional services

Workshop Co-organizer 2nd Workshop on Semantic Policy and Action Representations for Autonomous Robots (SPAR) September 24th 2017 as part of the IROS 2017 conference in Vancouver, Canada.
Workshop Co-organizer: “Deep Learning for Autonomous Robots (DLAR)” workshop at the 2016 Robotics: Science and Systems Conference. (Workshop page).
Workshop Co-organizer: “Semantic Policy and Action Representations (SPAR) for Autonomous Robots” workshop at the 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2015). (Workshop page).
Technical Program Committee or Reviewer: AAAI 2017, RSS 2016, ICPR 2016, IJCAI 2016, ICRA 2016, ICRA 2015, IROS 2014, ICME 2016, ICME 2014, ICME 2013, IROS 2013, PSIVT 2013,
Journal reviewer: International Jornal of Computer Vision (IJCV) The IEEE Transactions on Robotics (T-RO) Computer Vision and Image Understanding (CVIU), The Visual Computer, Information Sciences, Neurocomputing, Image and Vision Computing, Journal of Visual Communication and Image Representation.
Co-organizer: University of Maryland Robotics Graduate Student Seminar: Seminar Page.