Sitemap

A list of all the posts and pages found on the site. For you robots out there is an XML version available for digesting as well.

Page Not Found

Page not found. Your pixels are in another canvas.

Jupyter notebook markdown generator

Posts

Deploy your backend on AWS EC2

3 minute read

Published: August 15, 2021

Some random thoughts (My July rewind)

7 minute read

Published: July 31, 2021

It is 2.02 am right now and my sleepy has not come yet so I think it is a good chance to have some words. The fact that after the 1st covid vaccine dose in the middle of June, I feel much harder to go to sleep, not sure if the vaccine is the main reason or it comes from my other problem. Anyway, with that purpose, today’s blog is not a technical paper review or some algorithms implementation, this post is about my thoughts atm.

[Paper Explained] [AAAI2021] Structured Co-reference Graph Attention for Video-grounded Dialogue

9 minute read

Published: July 14, 2021

Flow up to my brief introduction about video dialogue of the previous blog, today, I will into detail with one of these state-of-the-art approaches in this topic. The paper I wanna introduce is Structured Co-reference Graph Attention for Video-grounded Dialogue, Junyeong et al. published at AAAI 2021. On a high level, the authors proposed a bipartite co-reference structure to connect the information over multiple modalities (visual, linguistic), and then capture information from the complex spatial as well as the temporal dynamics of video via an attention graph. By representing underlying dependencies between modalities, this design has moved 1 step forward in the reasoning over language and visual.

Video dialogue: Introduction

7 minute read

Published: June 09, 2021

Historically, having a system which can discuss as well as interact with you about the football matches/ movies.. with its own knowledge has been considered a very ambitious goal. More than the current AI Visual Model nowsaday, that system must be able to infer video from the past, describe the present, and predict the future. In other words, our system’s capacity must be enough reproduce human intelligent level in video understanding.

Older Blog Posts

less than 1 minute read

Published: January 01, 2020

For the older blogs, please visit my page at Viblo (unfortunately, all was written in Vietnamese). I had written all of those blogs while I had been starting to learn about AI.

portfolio

Sun Asterisk AI Team

Hanoi, Oct 20, 2019

VinAI Engineering Team

Hanoi, Feb 20, 2020

products

AI Fashion Lookup

Published: July 25, 2018

We made an app which automatically finds multiple similar clothing items from the large scaled database. The algorithm paper was published at a machine learning conference.

Self-Driving Car (size 1:7)

Published: May 25, 2019

We made a self-driving car, brought it to the FPT Digital Race 2018-2019 contest, and got the 2nd Prize (2/200 teams).

Suntana: 3D Virtual Personal Assistant

Published: October 24, 2019

We made a lively, visual and practical model of “3D virtual assistant”. Suntana brings a realistic experience to users, it can be personalized and specialized for certain tasks. For example: Welcome interviewees, meeting room booking .

Camera Effect: Bokeh (Portrait Mode)

Published: August 28, 2020

A bokeh effect is used in photography to highlight the most significant parts of an image and blur less important element. Using computer vision technology, we are responsible on making Portrait Mode for Vsmart phone’s Camera.

Todostep

Published: May 31, 2021

A social network for sharing productivity. A tool for spliting your ambitious target into daily tasks.

Miss Your Starbucks

Published: August 08, 2021

This is a project inspired by imissmycafe.com since I miss all of my coffee :( You should put your cup of coffee beside, choose your favorite starbucks view, adjust the environment sound, and start your energy working day!

publications

Large Scale Fashion Search System with Deep Learning and Quantization Indexing

Published in SoICT 2018, 2018

https://dl.acm.org/doi/abs/10.1145/3287921.3287964

Session-Based Recommendation with Self-Attention

Published in SoICT 2019, 2019

https://dl.acm.org/doi/abs/10.1145/3368926.3369682

The Right to Talk: An Audio-Visual Transformer Approach

Published in ICCV 2021, 2021

https://arxiv.org/abs/2108.03256

Video Dialog as Conversation about Objects Living in Space-Time

Published in ECCV 2022, 2022

https://arxiv.org/abs/2207.03656

talks

Vietnam Mobileday 2019: Modern Recommendation Systems

Published: June 14, 2019

This is my first talk in public ever. The topic in about modern recommender systems, why we need RS, how we collect data from user on our platform, how we analyze it and improve our model.

Vietnam Frontier Summit 2019: How can your business benefit from AI?

Published: October 06, 2019

This is the panel discuss where I talked with leaders of the other AI teams in Vietnam about “How can your business benefit from AI?”. Big thanks to Sun Asterisk for giving me this amazing chance.

teaching

Teaching experience 1

Undergraduate course, University 1, Department, 2014

This is a description of a teaching experience. You can use markdown like any other post.

Teaching experience 2

Workshop, University 1, Department, 2015