This is accomplished by systematically increasing the VQA pipeline such as: (1) pre-training with detailed visible and textual element representation (2) successful cross-modal conversation with studying to show up at and (3) A novel knowledge mining framework with specialised pro modules for the intricate VQA job.