As technologies are fast advancing, the importance of text detection and recognition is receiving special attention from the researchers. Thus, one can see several real-time applications of video text processing which requires cognitive-based methods to find a solution. The main applications are (1) retrieving and indexing video based on semantic of the content of the video, (2) machine translation to assist foreigners, (3) assisting blind people to walk on the road freely without aid, (4) automatic vehicle driving, (5) license plate tracing to catch vehicles which violate the traffic signals, (6) monitoring the images posted on social media based on text and content of the images, (7) identifying the location based on the address of the street and shops, etc., (8) tracing players in the sports based on the jersey/bib number or text, and (9) in the same way, tracing the bib number in case of marathon and other events. For the above-mentioned applications, text detection and recognition in video and natural scene images is an integral part of the system.