While reading the survey paper on Trends in Integration of Vision and Language Research I came across the following broad research areas in NLP and CV domains, jotting them here for future reference.
Natural Language Processing (NLP):
- Predominantly tasks can be segregated as language understanding, langauge generation
- Language Understanding
- Shallow Parsing, Syntax Parsing, Semantic Role Labeling, Named Entity Recognition
- Entity Linking, Co-reference Resolution
- Language Generation
- Machine Translation, Text Summarization, Question-Answering
- Dialog Generation, Story telling
Computer Vision (CV):
- Predominantly tasks are segregated as (i) using image/video as inputs (ii) using image/video for representation
- Using as input
- Image
- Image Classification, Object Localization, Object Detection
- Object Segmentation, Object Identification, Instance Segmentation, Panoptic Segmentation
- Video
- Action Classification, Object Tracking, Emotion Detection
- Scene Detection, Automatic Editing
- Image
NLP and CV Tasks Integration:
- Extension of NLP tasks to leverage CV
- Extension of CV tasks to leverage NLP
- Extension of NLP & CV together
#vision #language #research #themes #domain #researchwork #idea #machinelearning #ai #artificial #intelligence