Loading [MathJax]/extensions/MathMenu.js

Bo He - IEEE Xplore Author Profile

IEEE.org
IEEE Xplore
IEEE SA
IEEE Spectrum
More Sites

- Donate
- Personal Sign In

Institutional Sign In

Institutional Sign In

ADVANCED SEARCH

Author details

Bo He

Publications

4

Citations

95

Publications by Year

20222024

Co-Authors:

Trung BuiXuefei CaoZhiyu ChengYifei FanYoung Kyun Jang

Show All Co-Authors (20)

Bo He

Affiliation

University of Maryland

Meta

Publication Topics

Multimodal Tasks,
Time Step,
Video Frames,
Action Classes,
Action Localization,
Action Proposals,
Action Recognition,
Additional Modifications,
Alignment Information,
Attention Mechanism,
Attention Module,
Attention Weights

Publications

4

Citations

95

Publications by Year

20222024

Co-Authors:

Trung Bui
Xuefei Cao
Zhiyu Cheng
Yifei Fan
Young Kyun Jang

Show All Co-Authors (20)

Author's Published Works

Search History

Showing 1-4 of 4 results

Conferences (4)

Sort

Filter Results

Show

Open Access Only

Range
Single Year
Abhinav Shrivastava(4)
Bo He(4)
Ashish Shah(1)
Jielin Qiu(1)
Le Kang(1)
University of Maryland, College Park(3)
Adobe Research(2)
University of Central Florida(1)
University of Maryland(1)
Baidu Research, USA(1)
2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)(1)
2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)(1)
2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)(1)
2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)(1)
IEEE(4)
Media(1)
New Orleans, LA, USA(1)
Seattle, WA, USA(1)
Vancouver, BC, Canada(1)
Waikoloa, HI, USA(1)
Multimodal Tasks(2)
Time Step(2)
Video Frames(2)
Action Classes(1)
Action Localization(1)

Select All on Page

Sort By

Results

MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding

Bo He;Hengduo Li;Young Kyun Jang;Menglin Jia;Xuefei Cao;Ashish Shah;Abhinav Shrivastava;Ser-Nam Lim

2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Year: 2024 | Conference Paper |

Cited by: Papers (1)

HTML

With the success of large language models (LLMs), integrating the vision model into LLMs to build vision-language foundation models has gained much more interest recently. However, existing LLM-based large multimodal models (e.g., Video-LLaMA, VideoChat) can only take in a limited number of frames for short video understanding. In this study, we mainly focus on designing an efficient and effective...Show More

MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding

Bo He;Hengduo Li;Young Kyun Jang;Menglin Jia;Xuefei Cao;Ashish Shah;Abhinav Shrivastava;Ser-Nam Lim

2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Year: 2024 | Conference Paper |

Content-Aware Image Color Editing with Auxiliary Color Restoration Tasks

Yixuan Ren;Jing Shi;Zhifei Zhang;Yifei Fan;Zhe Lin;Bo He;Abhinav Shrivastava

2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)

Year: 2024 | Conference Paper |

HTML

Diversified image color editing is typically modeled as a multimodal image-to-image translation (MMI2IT) problem with an impact on multiple applications such as photo enhancement and retouching. Although previous GAN-based algorithms successfully generate diverse edits with clear control, we observe two issues remaining: Firstly, they tend to apply the same color style to all kinds of input images...Show More

Content-Aware Image Color Editing with Auxiliary Color Restoration Tasks

Yixuan Ren;Jing Shi;Zhifei Zhang;Yifei Fan;Zhe Lin;Bo He;Abhinav Shrivastava

2024 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)

Year: 2024 | Conference Paper |

Align and Attend: Multimodal Summarization with Dual Contrastive Losses

Bo He;Jun Wang;Jielin Qiu;Trung Bui;Abhinav Shrivastava;Zhaowen Wang

2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Year: 2023 | Conference Paper |

Cited by: Papers (26)

HTML

The goal of multimodal summarization is to extract the most important information from different modalities to form summaries. Unlike unimodal summarization, the multimodal summarization task explicitly leverages cross-modal information to help generate more reliable and high-quality summaries. However, existing methods fail to lever-age the temporal correspondence between different modal-ities an...Show More

Align and Attend: Multimodal Summarization with Dual Contrastive Losses

Bo He;Jun Wang;Jielin Qiu;Trung Bui;Abhinav Shrivastava;Zhaowen Wang

2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Year: 2023 | Conference Paper |

ASM-Loc: Action-aware Segment Modeling for Weakly-Supervised Temporal Action Localization

Bo He;Xitong Yang;Le Kang;Zhiyu Cheng;Xin Zhou;Abhinav Shrivastava

2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Year: 2022 | Conference Paper |

Cited by: Papers (68)

HTML

Weakly-supervised temporal action localization aims to recognize and localize action segments in untrimmed videos given only video-level action labels for training. Without the boundary information of action segments, existing methods mostly rely on multiple instance learning (MIL), where the predictions of unlabeled instances (i.e., video snippets) are supervised by classifying labeled bags (i.e....Show More

ASM-Loc: Action-aware Segment Modeling for Weakly-Supervised Temporal Action Localization

Bo He;Xitong Yang;Le Kang;Zhiyu Cheng;Xin Zhou;Abhinav Shrivastava

2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)

Year: 2022 | Conference Paper |

IEEE Personal Account

Change username/password

Purchase Details

Payment Options
View Purchased Documents

Profile Information

Communications Preferences
Profession and Education
Technical interests

Need Help?

US & Canada: +1 800 678 4333
Worldwide: +1 732 981 0060
Contact & Support

Follow

About IEEE Xplore | Contact Us | Help | Accessibility | Terms of Use | Nondiscrimination Policy | IEEE Ethics Reporting | Sitemap | IEEE Privacy Policy

A public charity, IEEE is the world's largest technical professional organization dedicated to advancing technology for the benefit of humanity.

© Copyright 2025 IEEE - All rights reserved, including rights for text and data mining and training of artificial intelligence and similar technologies.

IEEE Account

Change Username/Password
Update Address

Purchase Details

Payment Options
Order History
View Purchased Documents

Profile Information

Communications Preferences
Profession and Education
Technical Interests

Need Help?

US & Canada: +1 800 678 4333
Worldwide: +1 732 981 0060
Contact & Support

About IEEE Xplore
Contact Us
Help
Accessibility
Terms of Use
Nondiscrimination Policy
Sitemap
Privacy & Opting Out of Cookies

A not-for-profit organization, IEEE is the world's largest technical professional organization dedicated to advancing technology for the benefit of humanity.
© Copyright 2025 IEEE - All rights reserved. Use of this web site signifies your agreement to the terms and conditions.