Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 366 results for author: Khan, Z

.
  1. arXiv:2409.03028  [pdf, other

    eess.SY math.OC

    Rigid-Body Attitude Control on $\mathsf{SO(3)}$ using Nonlinear Dynamic Inversion

    Authors: Hafiz Zeeshan Iqbal Khan, Farooq Aslam, Muhammad Farooq Haydar, Jamshed Riaz

    Abstract: This paper presents a cascaded control architecture, based on nonlinear dynamic inversion (NDI), for rigid body attitude control. The proposed controller works directly with the rotation matrix parameterization, that is, with elements of the Special Orthogonal Group $\mathsf{SO(3)}$, and avoids problems related to singularities and non-uniqueness which affect other commonly used attitude represent… ▽ More

    Submitted 4 September, 2024; originally announced September 2024.

    Comments: 7 pages, 6 figures, accepted in IEEE Conference on Decision and Control (CDC), 2024

  2. arXiv:2409.01184  [pdf, other

    cs.CV

    PitVis-2023 Challenge: Workflow Recognition in videos of Endoscopic Pituitary Surgery

    Authors: Adrito Das, Danyal Z. Khan, Dimitrios Psychogyios, Yitong Zhang, John G. Hanrahan, Francisco Vasconcelos, You Pang, Zhen Chen, Jinlin Wu, Xiaoyang Zou, Guoyan Zheng, Abdul Qayyum, Moona Mazher, Imran Razzak, Tianbin Li, Jin Ye, Junjun He, Szymon Płotka, Joanna Kaleta, Amine Yamlahi, Antoine Jund, Patrick Godau, Satoshi Kondo, Satoshi Kasai, Kousuke Hirasawa , et al. (7 additional authors not shown)

    Abstract: The field of computer vision applied to videos of minimally invasive surgery is ever-growing. Workflow recognition pertains to the automated recognition of various aspects of a surgery: including which surgical steps are performed; and which surgical instruments are used. This information can later be used to assist clinicians when learning the surgery; during live surgery; and when writing operat… ▽ More

    Submitted 2 September, 2024; originally announced September 2024.

  3. arXiv:2408.12975  [pdf, other

    astro-ph.IM

    The UK Submillimetre and Millimetre Astronomy Roadmap 2024

    Authors: K. Pattle, P. S. Barry, A. W. Blain, M. Booth, R. A. Booth, D. L. Clements, M. J. Currie, S. Doyle, D. Eden, G. A. Fuller, M. Griffin, P. G. Huggard, J. D. Ilee, J. Karoly, Z. A. Khan, N. Klimovich, E. Kontar, P. Klaassen, A. J. Rigby, P. Scicluna, S. Serjeant, B. -K. Tan, D. Ward-Thompson, T. G. Williams, T. A. Davis , et al. (9 additional authors not shown)

    Abstract: In this Roadmap, we present a vision for the future of submillimetre and millimetre astronomy in the United Kingdom over the next decade and beyond. This Roadmap has been developed in response to the recommendation of the Astronomy Advisory Panel (AAP) of the STFC in the AAP Astronomy Roadmap 2022. In order to develop our stragetic priorities and recommendations, we surveyed the UK submillimetre a… ▽ More

    Submitted 3 September, 2024; v1 submitted 23 August, 2024; originally announced August 2024.

    Comments: 91 pages plus cover, 38 figures. Submitted to the Science and Technology Facilities Council, August 2024. One figure corrected (v2); new appendix with STFC Q&A; corrected SMA access statement; updated references, acronyms & author list (v3)

  4. arXiv:2408.08454  [pdf, other

    cs.CV cs.LG

    Beyond Uniform Query Distribution: Key-Driven Grouped Query Attention

    Authors: Zohaib Khan, Muhammad Khaquan, Omer Tafveez, Burhanuddin Samiwala, Agha Ali Raza

    Abstract: The Transformer architecture has revolutionized deep learning through its Self-Attention mechanism, which effectively captures contextual information. However, the memory footprint of Self-Attention presents significant challenges for long-sequence tasks. Grouped Query Attention (GQA) addresses this issue by grouping queries and mean-pooling the corresponding key-value heads - reducing the number… ▽ More

    Submitted 28 August, 2024; v1 submitted 15 August, 2024; originally announced August 2024.

    Comments: 11 pages, 9 figures

  5. arXiv:2408.08372  [pdf, other

    hep-th math.QA math.SG

    On the Algebra of the Infrared with Twisted Masses

    Authors: Ahsan Z. Khan, Gregory W. Moore

    Abstract: The Algebra of the Infrared \cite{Gaiotto:2015aoa} is a framework to construct local observables, interfaces, and categories of supersymmetric boundary conditions of massive $\mathcal{N}=(2,2)$ theories in two dimensions by using information only about the BPS sector. The resulting framework is known as the ``web-based formalism.'' In this paper we initiate the generalization of the web-based form… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

    Comments: 112 pages

  6. arXiv:2406.10889  [pdf, other

    cs.CV cs.AI cs.LG

    VELOCITI: Can Video-Language Models Bind Semantic Concepts through Time?

    Authors: Darshana Saravanan, Darshan Singh, Varun Gupta, Zeeshan Khan, Vineet Gandhi, Makarand Tapaswi

    Abstract: Compositionality is a fundamental aspect of vision-language understanding and is especially required for videos since they contain multiple entities (e.g. persons, actions, and scenes) interacting dynamically over time. Existing benchmarks focus primarily on perception capabilities. However, they do not study binding, the ability of a model to associate entities through appropriate relationships.… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

    Comments: 26 pages, 17 figures, 3 tables

  7. arXiv:2405.17788  [pdf, other

    cs.CV

    Enhancing Road Safety: Real-Time Detection of Driver Distraction through Convolutional Neural Networks

    Authors: Amaan Aijaz Sheikh, Imaad Zaffar Khan

    Abstract: As we navigate our daily commutes, the threat posed by a distracted driver is at a large, resulting in a troubling rise in traffic accidents. Addressing this safety concern, our project harnesses the analytical power of Convolutional Neural Networks (CNNs), with a particular emphasis on the well-established models VGG16 and VGG19. These models are acclaimed for their precision in image recognition… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  8. arXiv:2405.13949  [pdf, other

    cs.CV

    PitVQA: Image-grounded Text Embedding LLM for Visual Question Answering in Pituitary Surgery

    Authors: Runlong He, Mengya Xu, Adrito Das, Danyal Z. Khan, Sophia Bano, Hani J. Marcus, Danail Stoyanov, Matthew J. Clarkson, Mobarakol Islam

    Abstract: Visual Question Answering (VQA) within the surgical domain, utilizing Large Language Models (LLMs), offers a distinct opportunity to improve intra-operative decision-making and facilitate intuitive surgeon-AI interaction. However, the development of LLMs for surgical VQA is hindered by the scarcity of diverse and extensive datasets with complex reasoning tasks. Moreover, contextual fusion of the i… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: 10 pages, 3 figures

  9. arXiv:2405.11483  [pdf, other

    cs.CV

    MICap: A Unified Model for Identity-aware Movie Descriptions

    Authors: Haran Raajesh, Naveen Reddy Desanur, Zeeshan Khan, Makarand Tapaswi

    Abstract: Characters are an important aspect of any storyline and identifying and including them in descriptions is necessary for story understanding. While previous work has largely ignored identity and generated captions with someone (anonymized names), recent work formulates id-aware captioning as a fill-in-the-blanks (FITB) task, where, given a caption with blanks, the goal is to predict person id label… ▽ More

    Submitted 19 May, 2024; originally announced May 2024.

    Comments: CVPR 2024, Project Page: https://katha-ai.github.io/projects/micap/

  10. arXiv:2405.09361  [pdf, other

    hep-th cond-mat.str-el quant-ph

    Quantum operations for Kramers-Wannier duality

    Authors: Maaz Khan, Syed Anausha Bin Zakir Khan, Arif Mohd

    Abstract: We study the Kramers-Wannier duality for the transverse-field Ising lattice on a ring. A careful consideration of the ring boundary conditions shows that the duality has to be implemented with a proper treatment of different charge sectors of both the twisted and untwisted Ising and the dual-Ising Hilbert spaces. We construct a superoperator that explicitly maps the Ising operators to the dual-Isi… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

    Comments: 7 pages, 1 table, 4 figures

  11. arXiv:2405.02926  [pdf, ps, other

    nucl-th

    Study of charge changing and interaction cross sections for 4$\leq$Z$ \leq$9 isotopes

    Authors: M. Imran, Z. Hasan, A. A. Usmani, Z. A. Khan

    Abstract: The root-mean-square proton and neutron radii for $^{7,9-12,14}\rm$ Be, $^{10-15,17}\rm$ B, $^{12-19}\rm$ C, $^{14,15,17-22}\rm$ N, $^{16,18-24}\rm$ O, and $^{18-21,23-26}\rm$ F isotopes are deduced from a systematic analysis of experimental charge changing and interaction cross sections in the framework of Glauber model. The calculations involve descriptions of nuclei based on Slater determinants… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

  12. arXiv:2405.02907  [pdf

    cond-mat.mtrl-sci

    Nanostructured BiVO4 Photoanodes Fabricated by Vanadium-infused Interaction for Efficient Solar Water Splitting

    Authors: Amar K. Salih, Abdul Zeeshan Khan, Qasem A. Drmosh, Tarek A. Kandiel, Mohammad Qamar, Tahir Naveed Jahangir, Cuong Ton-That, Zain H. Yamani

    Abstract: Bismuth vanadate (BiVO4) has emerged as a highly prospective material for photoanodes in photoelectrochemical (PEC) water oxidation. However, current limitations with this material lie in the difficulties in producing stable and continuous BiVO4 layers with efficient carrier transfer kinetics, thereby impeding its widespread application in water splitting processes. This study introduces a new fab… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

  13. arXiv:2404.10193  [pdf, other

    cs.CV

    Consistency and Uncertainty: Identifying Unreliable Responses From Black-Box Vision-Language Models for Selective Visual Question Answering

    Authors: Zaid Khan, Yun Fu

    Abstract: The goal of selective prediction is to allow an a model to abstain when it may not be able to deliver a reliable prediction, which is important in safety-critical contexts. Existing approaches to selective prediction typically require access to the internals of a model, require retraining a model or study only unimodal models. However, the most powerful models (e.g. GPT-4) are typically only avail… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: CVPR 2024

  14. arXiv:2404.04627  [pdf, other

    cs.CV

    Self-Training Large Language Models for Improved Visual Program Synthesis With Visual Reinforcement

    Authors: Zaid Khan, Vijay Kumar BG, Samuel Schulter, Yun Fu, Manmohan Chandraker

    Abstract: Visual program synthesis is a promising approach to exploit the reasoning abilities of large language models for compositional computer vision tasks. Previous work has used few-shot prompting with frozen LLMs to synthesize visual programs. Training an LLM to write better visual programs is an attractive prospect, but it is unclear how to accomplish this. No dataset of visual programs for training… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

    Comments: CVPR 2024

  15. arXiv:2403.09715  [pdf, other

    cs.SE cs.CL cs.CR cs.LG

    Textual analysis of End User License Agreement for red-flagging potentially malicious software

    Authors: Behraj Khan, Tahir Syed, Zeshan Khan, Muhammad Rafi

    Abstract: New software and updates are downloaded by end users every day. Each dowloaded software has associated with it an End Users License Agreements (EULA), but this is rarely read. An EULA includes information to avoid legal repercussions. However,this proposes a host of potential problems such as spyware or producing an unwanted affect in the target system. End users do not read these EULA's because o… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

  16. arXiv:2402.05126  [pdf, other

    cs.CL cs.LG

    Graph Neural Network and NER-Based Text Summarization

    Authors: Imaad Zaffar Khan, Amaan Aijaz Sheikh, Utkarsh Sinha

    Abstract: With the abundance of data and information in todays time, it is nearly impossible for man, or, even machine, to go through all of the data line by line. What one usually does is to try to skim through the lines and retain the absolutely important information, that in a more formal term is called summarization. Text summarization is an important task that aims to compress lengthy documents or arti… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

  17. arXiv:2401.12728  [pdf, other

    astro-ph.SR astro-ph.GA

    Filamentary Network and Magnetic Field Structures Revealed with BISTRO in the High-Mass Star-Forming Region NGC2264 : Global Properties and Local Magnetogravitational Configurations

    Authors: Jia-Wei Wang, Patrick M. Koch, Seamus D. Clarke, Gary Fuller, Nicolas Peretto, Ya-Wen Tang, Hsi-Wei Yen, Shih-Ping Lai, Nagayoshi Ohashi, Doris Arzoumanian, Doug Johnstone, Ray Furuya, Shu-ichiro Inutsuka, Chang Won Lee, Derek Ward-Thompson, Valentin J. M. Le Gouellec, Hong-Li Liu, Lapo Fanciullo, Jihye Hwang, Kate Pattle, Frédérick Poidevin, Mehrnoosh Tahani, Takashi Onaka, Mark G. Rawlings, Eun Jung Chung , et al. (132 additional authors not shown)

    Abstract: We report 850 $μ$m continuum polarization observations toward the filamentary high-mass star-forming region NGC 2264, taken as part of the B-fields In STar forming Regions Observations (BISTRO) large program on the James Clerk Maxwell Telescope (JCMT). These data reveal a well-structured non-uniform magnetic field in the NGC 2264C and 2264D regions with a prevailing orientation around 30 deg from… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Comments: Accepted for publication in the Astrophysical Journal. 43 pages, 32 figures, and 4 tables (including Appendix)

  18. arXiv:2401.12667  [pdf, ps, other

    stat.ML cs.LG

    Feature Selection via Robust Weighted Score for High Dimensional Binary Class-Imbalanced Gene Expression Data

    Authors: Zardad Khan, Amjad Ali, Saeed Aldahmani

    Abstract: In this paper, a robust weighted score for unbalanced data (ROWSU) is proposed for selecting the most discriminative feature for high dimensional gene expression binary classification with class-imbalance problem. The method addresses one of the most challenging problems of highly skewed class distributions in gene expression datasets that adversely affect the performance of classification algorit… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

    Comments: 25 pages

    MSC Class: 14J60

  19. arXiv:2401.07669  [pdf, other

    cs.CV

    FiGCLIP: Fine-Grained CLIP Adaptation via Densely Annotated Videos

    Authors: Darshan Singh S, Zeeshan Khan, Makarand Tapaswi

    Abstract: While contrastive language image pretraining (CLIP) have exhibited impressive performance by learning highly semantic and generalized representations, recent works have exposed a fundamental drawback in its syntactic properties, that includes interpreting fine-grained attributes, actions, spatial relations, states, and details that require compositional reasoning. One reason for this is that natur… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

  20. arXiv:2311.09762  [pdf, other

    cs.CL cs.AI cs.LG

    Graph Elicitation for Guiding Multi-Step Reasoning in Large Language Models

    Authors: Jinyoung Park, Ameen Patel, Omar Zia Khan, Hyunwoo J. Kim, Joo-Kyung Kim

    Abstract: Chain-of-Thought (CoT) prompting along with sub-question generation and answering has enhanced multi-step reasoning capabilities of Large Language Models (LLMs). However, prompting the LLMs to directly generate sub-questions is suboptimal since they sometimes generate redundant or irrelevant questions. To deal with them, we propose a GE-Reasoning method, which directs LLMs to generate proper sub-q… ▽ More

    Submitted 22 June, 2024; v1 submitted 16 November, 2023; originally announced November 2023.

    Comments: Preprint

  21. arXiv:2310.20081  [pdf, other

    cs.CL cs.AI cs.IR

    Integrating Summarization and Retrieval for Enhanced Personalization via Large Language Models

    Authors: Chris Richardson, Yao Zhang, Kellen Gillespie, Sudipta Kar, Arshdeep Singh, Zeynab Raeesy, Omar Zia Khan, Abhinav Sethy

    Abstract: Personalization, the ability to tailor a system to individual users, is an essential factor in user experience with natural language processing (NLP) systems. With the emergence of Large Language Models (LLMs), a key question is how to leverage these models to better personalize user experiences. To personalize a language model's output, a straightforward approach is to incorporate past user data… ▽ More

    Submitted 30 October, 2023; originally announced October 2023.

    Comments: 4 pages, International Workshop on Personalized Generative AI (@CIKM 2023)

    ACM Class: I.2.7; H.3.3

  22. arXiv:2310.17954  [pdf, other

    eess.IV cs.CV

    Multivessel Coronary Artery Segmentation and Stenosis Localisation using Ensemble Learning

    Authors: Muhammad Bilal, Dinis Martinho, Reiner Sim, Adnan Qayyum, Hunaid Vohra, Massimo Caputo, Taofeek Akinosho, Sofiat Abioye, Zaheer Khan, Waleed Niaz, Junaid Qadir

    Abstract: Coronary angiography analysis is a common clinical task performed by cardiologists to diagnose coronary artery disease (CAD) through an assessment of atherosclerotic plaque's accumulation. This study introduces an end-to-end machine learning solution developed as part of our solution for the MICCAI 2023 Automatic Region-based Coronary Artery Disease diagnostics using x-ray angiography imagEs (ARCA… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

    Comments: Submission report for ARCADE challenge hosted at MICCAI2023

  23. arXiv:2310.17050  [pdf, other

    cs.CV

    Exploring Question Decomposition for Zero-Shot VQA

    Authors: Zaid Khan, Vijay Kumar BG, Samuel Schulter, Manmohan Chandraker, Yun Fu

    Abstract: Visual question answering (VQA) has traditionally been treated as a single-step task where each question receives the same amount of effort, unlike natural human question-answering strategies. We explore a question decomposition strategy for VQA to overcome this limitation. We probe the ability of recently developed large vision-language models to use human-written decompositions and produce their… ▽ More

    Submitted 25 October, 2023; originally announced October 2023.

    Comments: NeurIPS 2023 Camera Ready

  24. arXiv:2310.17032  [pdf, other

    quant-ph cs.LG

    Quantum Long Short-Term Memory (QLSTM) vs Classical LSTM in Time Series Forecasting: A Comparative Study in Solar Power Forecasting

    Authors: Saad Zafar Khan, Nazeefa Muzammil, Salman Ghafoor, Haibat Khan, Syed Mohammad Hasan Zaidi, Abdulah Jeza Aljohani, Imran Aziz

    Abstract: Accurate solar power forecasting is pivotal for the global transition towards sustainable energy systems. This study conducts a meticulous comparison between Quantum Long Short-Term Memory (QLSTM) and classical Long Short-Term Memory (LSTM) models for solar power production forecasting. The primary objective is to evaluate the potential advantages of QLSTMs, leveraging their exponential representa… ▽ More

    Submitted 9 April, 2024; v1 submitted 25 October, 2023; originally announced October 2023.

    Comments: 33 pages, 9 figures

  25. arXiv:2309.13033  [pdf, ps, other

    eess.SY

    Robust Stability Analysis of a Class of LTV Systems

    Authors: Shahzad Ahmed, Hafiz Zeeshan Iqbal Khan, Jamshed Riaz

    Abstract: Many physical systems are inherently time-varying in nature. When these systems are linearized around a trajectory, generally, the resulting system is Linear Time-Varying (LTV). LTV systems describe an important class of linear systems and can be thought of as a natural extension of LTI systems. However, it is well known that, unlike LTI systems, the eigenvalues of an LTV system do not determine i… ▽ More

    Submitted 22 September, 2023; originally announced September 2023.

    Comments: Presented at 20th International Bhurban Conference on Applied Sciences and Technology (IBCAST), 2023

  26. arXiv:2309.13032  [pdf, ps, other

    eess.SY

    Modelling, Simulation, and Control of a Flexible Space Launch Vehicle

    Authors: Muhammad Abdullah Aamer, Qurat Ul Ain, Ushbah Kaleem, Hafiz Zeeshan Iqbal Khan, Jamshed Riaz

    Abstract: Modern Space Launch Vehicles (SLVs), being slender in shape and due to the use of lightweight materials, are generally flexible in nature. This structural flexibility, when coupled with sensor and actuator dynamics, can adversely affect the control of SLV, which may lead to vehicle instability and, in the worst-case scenario, to structural failure. This work focuses on modelling and simulation of… ▽ More

    Submitted 22 September, 2023; originally announced September 2023.

    Comments: Presented at 20th International Bhurban Conference on Applied Sciences and Technology (IBCAST), 2023

  27. arXiv:2309.10774  [pdf, ps, other

    eess.SY math.OC

    Inverse Optimal Control Design of a VTOL Aircraft

    Authors: Kinza Rehman, Hafiz Zeeshan Iqbal Khan, Muhammad Farooq Haydar

    Abstract: A Vertical Takeoff and Landing (VTOL) aircraft is capable of short/vertical takeoffs and landings, thus eliminating the requirement of long runways. This feature makes them suitable for a broader variety of missions, generally not achievable by a traditional aircraft. The dynamics of the VTOL aircraft in the vertical plane has been used as a benchmark problem to demonstrate the effectiveness of di… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

    Comments: Presented at 20th International Bhurban Conference on Applied Sciences and Technology (IBCAST), 2023

  28. arXiv:2308.15827  [pdf, other

    cs.CV

    Introducing Language Guidance in Prompt-based Continual Learning

    Authors: Muhammad Gul Zain Ali Khan, Muhammad Ferjad Naeem, Luc Van Gool, Didier Stricker, Federico Tombari, Muhammad Zeshan Afzal

    Abstract: Continual Learning aims to learn a single model on a sequence of tasks without having access to data from previous tasks. The biggest challenge in the domain still remains catastrophic forgetting: a loss in performance on seen classes of earlier tasks. Some existing methods rely on an expensive replay buffer to store a chunk of data from previous tasks. This, while promising, becomes expensive whe… ▽ More

    Submitted 30 August, 2023; originally announced August 2023.

    Comments: Accepted at ICCV 2023

  29. arXiv:2307.16262  [pdf, other

    eess.IV cs.CV

    Validating polyp and instrument segmentation methods in colonoscopy through Medico 2020 and MedAI 2021 Challenges

    Authors: Debesh Jha, Vanshali Sharma, Debapriya Banik, Debayan Bhattacharya, Kaushiki Roy, Steven A. Hicks, Nikhil Kumar Tomar, Vajira Thambawita, Adrian Krenzer, Ge-Peng Ji, Sahadev Poudel, George Batchkala, Saruar Alam, Awadelrahman M. A. Ahmed, Quoc-Huy Trinh, Zeshan Khan, Tien-Phat Nguyen, Shruti Shrestha, Sabari Nathan, Jeonghwan Gwak, Ritika K. Jha, Zheyuan Zhang, Alexander Schlaefer, Debotosh Bhattacharjee, M. K. Bhuyan , et al. (8 additional authors not shown)

    Abstract: Automatic analysis of colonoscopy images has been an active field of research motivated by the importance of early detection of precancerous polyps. However, detecting polyps during the live examination can be challenging due to various factors such as variation of skills and experience among the endoscopists, lack of attentiveness, and fatigue leading to a high polyp miss-rate. Deep learning has… ▽ More

    Submitted 6 May, 2024; v1 submitted 30 July, 2023; originally announced July 2023.

  30. arXiv:2306.03932  [pdf, other

    cs.CV

    Q: How to Specialize Large Vision-Language Models to Data-Scarce VQA Tasks? A: Self-Train on Unlabeled Images!

    Authors: Zaid Khan, Vijay Kumar BG, Samuel Schulter, Xiang Yu, Yun Fu, Manmohan Chandraker

    Abstract: Finetuning a large vision language model (VLM) on a target dataset after large scale pretraining is a dominant paradigm in visual question answering (VQA). Datasets for specialized tasks such as knowledge-based VQA or VQA in non natural-image domains are orders of magnitude smaller than those for general-purpose VQA. While collecting additional labels for specialized tasks or domains can be challe… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

    Comments: CVPR 2023

  31. arXiv:2306.01819  [pdf

    cs.PL cs.AI

    Comparative Analysis of Widely use Object-Oriented Languages

    Authors: Muhammad Shoaib Farooq, Taymour zaman Khan

    Abstract: Programming is an integral part of computer science discipline. Every day the programming environment is not only rapidly growing but also changing and languages are constantly evolving. Learning of object-oriented paradigm is compulsory in every computer science major so the choice of language to teach object-oriented principles is very important. Due to large pool of object-oriented languages, i… ▽ More

    Submitted 2 June, 2023; originally announced June 2023.

    Comments: 30 pages, figures 2

  32. Impact of Log Parsing on Deep Learning-Based Anomaly Detection

    Authors: Zanis Ali Khan, Donghwan Shin, Domenico Bianculli, Lionel Briand

    Abstract: Software systems log massive amounts of data, recording important runtime information. Such logs are used, for example, for log-based anomaly detection, which aims to automatically detect abnormal behaviors of the system under analysis by processing the information recorded in its logs. Many log-based anomaly detection techniques based on deep learning models include a pre-processing step called l… ▽ More

    Submitted 19 August, 2024; v1 submitted 25 May, 2023; originally announced May 2023.

    Journal ref: Empir Software Eng 29, 139 (2024)

  33. arXiv:2305.06934  [pdf, other

    cs.SE cs.AI cs.CL cs.CY cs.LG cs.PL

    Humans are Still Better than ChatGPT: Case of the IEEEXtreme Competition

    Authors: Anis Koubaa, Basit Qureshi, Adel Ammar, Zahid Khan, Wadii Boulila, Lahouari Ghouti

    Abstract: Since the release of ChatGPT, numerous studies have highlighted the remarkable performance of ChatGPT, which often rivals or even surpasses human capabilities in various tasks and domains. However, this paper presents a contrasting perspective by demonstrating an instance where human performance excels in typical tasks suited for ChatGPT, specifically in the domain of computer programming. We util… ▽ More

    Submitted 10 May, 2023; originally announced May 2023.

    Comments: 9 pages, 3 figures

  34. arXiv:2304.09756  [pdf, other

    cs.LG cs.AI cs.HC eess.SP

    Contactless Human Activity Recognition using Deep Learning with Flexible and Scalable Software Define Radio

    Authors: Muhammad Zakir Khan, Jawad Ahmad, Wadii Boulila, Matthew Broadbent, Syed Aziz Shah, Anis Koubaa, Qammer H. Abbasi

    Abstract: Ambient computing is gaining popularity as a major technological advancement for the future. The modern era has witnessed a surge in the advancement in healthcare systems, with viable radio frequency solutions proposed for remote and unobtrusive human activity recognition (HAR). Specifically, this study investigates the use of Wi-Fi channel state information (CSI) as a novel method of ambient sens… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

  35. arXiv:2304.06533  [pdf

    cond-mat.mtrl-sci

    Probing magnetic ordering in air stable iron-rich van der Waals minerals

    Authors: Muhammad Zubair Khan, Oleg E. Peil, Apoorva Sharma, Oleksandr Selyshchev, Sergio Valencia, Florian Kronast, Maik Zimmermann, Muhammad Awais Aslam, Johann G. Raith, Christian Teichert, Dietrich R. T. Zahn, Georgeta Salvan, Aleksandar Matković, Chair of Physics, Department Physics, Mechanics, Electrical engineering, Montanuniversität Leoben, 8700, Leoben, Austria., Materials Center Leoben Forschung GmbH, 8700, Leoben, Austria. , et al. (24 additional authors not shown)

    Abstract: In the rapidly expanding field of two-dimensional materials, magnetic monolayers show great promise for the future applications in nanoelectronics, data storage, and sensing. The research in intrinsically magnetic two-dimensional materials mainly focuses on synthetic iodide and telluride based compounds, which inherently suffer from the lack of ambient stability. So far, naturally occurring layere… ▽ More

    Submitted 13 April, 2023; originally announced April 2023.

    Comments: 19 pages, 6 figures

  36. arXiv:2304.04161  [pdf

    eess.IV cs.CV

    Detection of COVID19 in Chest X-Ray Images Using Transfer Learning

    Authors: Zanoby N. Khan

    Abstract: COVID19 is a highly contagious disease infected millions of people worldwide. With limited testing components, screening tools such as chest radiography can assist the clinicians in the diagnosis and assessing the progress of disease. The performance of deep learning-based systems for diagnosis of COVID-19 disease in radiograph images has been encouraging. This paper investigates the concept of tr… ▽ More

    Submitted 9 April, 2023; originally announced April 2023.

  37. A Low-Complexity Diversity-Preserving Universal Bit-Flipping Enhanced Hard Decision Decoder for Arbitrary Linear Codes

    Authors: Praveen Sai Bere, Mohammed Zafar Ali Khan, Lajos Hanzo

    Abstract: V2X (Vehicle-to-everything) communication relies on short messages for short-range transmissions over a fading wireless channel, yet requires high reliability and low latency. Hard-decision decoding sacrifices the preservation of diversity order, leading to pronounced performance degradation in fading channels. By contrast, soft-decision decoding retains diversity order, albeit at the cost of in… ▽ More

    Submitted 7 August, 2024; v1 submitted 7 April, 2023; originally announced April 2023.

    Comments: Journal of 23 pages

  38. arXiv:2303.12210  [pdf, ps, other

    stat.ML cs.LG

    A Random Projection k Nearest Neighbours Ensemble for Classification via Extended Neighbourhood Rule

    Authors: Amjad Ali, Muhammad Hamraz, Dost Muhammad Khan, Wajdan Deebani, Zardad Khan

    Abstract: Ensembles based on k nearest neighbours (kNN) combine a large number of base learners, each constructed on a sample taken from a given training data. Typical kNN based ensembles determine the k closest observations in the training data bounded to a test sample point by a spherical region to predict its class. In this paper, a novel random projection extended neighbourhood rule (RPExNRule) ensemble… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

    Comments: 23 pages, 8 diagrams, 69 references

    ACM Class: F.2.2

  39. arXiv:2303.11866  [pdf, other

    cs.CV

    Contrastive Alignment of Vision to Language Through Parameter-Efficient Transfer Learning

    Authors: Zaid Khan, Yun Fu

    Abstract: Contrastive vision-language models (e.g. CLIP) are typically created by updating all the parameters of a vision model and language model through contrastive training. Can such models be created by a small number of parameter updates to an already-trained language model and vision model? The literature describes techniques that can create vision-language models by updating a small number of paramet… ▽ More

    Submitted 21 March, 2023; originally announced March 2023.

    Comments: Accepted to ICLR 2023

  40. TAU: A Framework for Video-Based Traffic Analytics Leveraging Artificial Intelligence and Unmanned Aerial Systems

    Authors: Bilel Benjdira, Anis Koubaa, Ahmad Taher Azar, Zahid Khan, Adel Ammar, Wadii Boulila

    Abstract: Smart traffic engineering and intelligent transportation services are in increasing demand from governmental authorities to optimize traffic performance and thus reduce energy costs, increase the drivers' safety and comfort, ensure traffic laws enforcement, and detect traffic violations. In this paper, we address this challenge, and we leverage the use of Artificial Intelligence (AI) and Unmanned… ▽ More

    Submitted 1 March, 2023; originally announced March 2023.

    Comments: This is the final proofread version submitted to Elsevier EAAI: please see the published version at: https://doi.org/10.1016/j.engappai.2022.105095

    Journal ref: Engineering Applications of Artificial Intelligence, Volume 114, 2022, 105095, ISSN 0952-1976

  41. arXiv:2302.10978  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    Learning to Retrieve Engaging Follow-Up Queries

    Authors: Christopher Richardson, Sudipta Kar, Anjishnu Kumar, Anand Ramachandran, Omar Zia Khan, Zeynab Raeesy, Abhinav Sethy

    Abstract: Open domain conversational agents can answer a broad range of targeted queries. However, the sequential nature of interaction with these systems makes knowledge exploration a lengthy task which burdens the user with asking a chain of well phrased questions. In this paper, we present a retrieval based system and associated dataset for predicting the next questions that the user might have. Such a s… ▽ More

    Submitted 21 February, 2023; originally announced February 2023.

    Comments: EACL 2023

  42. arXiv:2301.04115  [pdf, other

    eess.SP

    Sensing the Environment with 5G Scattered Signals (5G-CommSense): A Feasibility Analysis

    Authors: Sandip Jana, Amit Kumar Mishra, Mohammed Zafar Ali Khan

    Abstract: By making use of the sensors and AI (SensAI) algorithms for a specialized task, Application Specific INstrumentation (ASIN) framework uses less computational overhead and gives a good performance. This work evaluates the feasibility of the ASIN framework dependent Communication based Sensing (CommSense) system using 5th Generation New Radio (5G NR) infrastructure. Since our proposed system is back… ▽ More

    Submitted 10 January, 2023; originally announced January 2023.

    Comments: 3 pages, Accepted in conference

  43. arXiv:2301.00343  [pdf, ps, other

    hep-th math.SG

    On the $A_{\infty}$-Category of a Holomorphic Moment Map

    Authors: Ahsan Z. Khan

    Abstract: Let $M$ be a hyperKähler manifold equipped with a $U(1)$ hyperKähler isometry, and let $I$ be a complex structure on $M$. In this note, we study the $A_{\infty}$-category of A-branes for the Landau-Ginzburg model with target space $(M,I)$, and superpotential being the $I$-holomorphic moment map. We show that if $I$ is a generic complex structure, the $A_{\infty}$-category is semi-simple. For excep… ▽ More

    Submitted 31 December, 2022; originally announced January 2023.

    Comments: 13 pages

  44. arXiv:2212.02291  [pdf, other

    cs.CV

    I2MVFormer: Large Language Model Generated Multi-View Document Supervision for Zero-Shot Image Classification

    Authors: Muhammad Ferjad Naeem, Muhammad Gul Zain Ali Khan, Yongqin Xian, Muhammad Zeshan Afzal, Didier Stricker, Luc Van Gool, Federico Tombari

    Abstract: Recent works have shown that unstructured text (documents) from online sources can serve as useful auxiliary information for zero-shot image classification. However, these methods require access to a high-quality source like Wikipedia and are limited to a single source of information. Large Language Models (LLM) trained on web-scale text show impressive abilities to repurpose their learned knowled… ▽ More

    Submitted 5 December, 2022; originally announced December 2022.

  45. arXiv:2211.11278  [pdf, ps, other

    stat.ML cs.LG

    Optimal Extended Neighbourhood Rule $k$ Nearest Neighbours Ensemble

    Authors: Amjad Ali, Zardad Khan, Dost Muhammad Khan, Saeed Aldahmani

    Abstract: The traditional k nearest neighbor (kNN) approach uses a distance formula within a spherical region to determine the k closest training observations to a test sample point. However, this approach may not work well when test point is located outside this region. Moreover, aggregating many base kNN learners can result in poor ensemble performance due to high classification errors. To address these i… ▽ More

    Submitted 15 February, 2024; v1 submitted 21 November, 2022; originally announced November 2022.

    Comments: This manuscript has been submitted for publication in the esteemed journal Pattern Recognition Letters

    MSC Class: 14J60

  46. arXiv:2210.11557  [pdf, other

    cs.CV

    Learning Attention Propagation for Compositional Zero-Shot Learning

    Authors: Muhammad Gul Zain Ali Khan, Muhammad Ferjad Naeem, Luc Van Gool, Alain Pagani, Didier Stricker, Muhammad Zeshan Afzal

    Abstract: Compositional zero-shot learning aims to recognize unseen compositions of seen visual primitives of object classes and their states. While all primitives (states and objects) are observable during training in some combination, their complex interaction makes this task especially hard. For example, wet changes the visual appearance of a dog very differently from a bicycle. Furthermore, we argue tha… ▽ More

    Submitted 20 October, 2022; originally announced October 2022.

  47. arXiv:2210.10828  [pdf, other

    cs.CV

    Grounded Video Situation Recognition

    Authors: Zeeshan Khan, C. V. Jawahar, Makarand Tapaswi

    Abstract: Dense video understanding requires answering several questions such as who is doing what to whom, with what, how, why, and where. Recently, Video Situation Recognition (VidSitu) is framed as a task for structured prediction of multiple events, their relationships, and actions and various verb-role pairs attached to descriptive entities. This task poses several challenges in identifying, disambigua… ▽ More

    Submitted 19 October, 2022; originally announced October 2022.

    Comments: Accepted to NeurIPS 2022. Project Page: https://zeeshank95.github.io/grvidsitu

  48. arXiv:2210.04429  [pdf, other

    eess.IV cs.CV

    DeepHS-HDRVideo: Deep High Speed High Dynamic Range Video Reconstruction

    Authors: Zeeshan Khan, Parth Shettiwar, Mukul Khanna, Shanmuganathan Raman

    Abstract: Due to hardware constraints, standard off-the-shelf digital cameras suffers from low dynamic range (LDR) and low frame per second (FPS) outputs. Previous works in high dynamic range (HDR) video reconstruction uses sequence of alternating exposure LDR frames as input, and align the neighbouring frames using optical flow based networks. However, these methods often result in motion artifacts in chal… ▽ More

    Submitted 10 October, 2022; originally announced October 2022.

    Comments: ICPR 2022

  49. Gastrointestinal Disorder Detection with a Transformer Based Approach

    Authors: A. K. M. Salman Hosain, Mynul islam, Md Humaion Kabir Mehedi, Irteza Enan Kabir, Zarin Tasnim Khan

    Abstract: Accurate disease categorization using endoscopic images is a significant problem in Gastroenterology. This paper describes a technique for assisting medical diagnosis procedures and identifying gastrointestinal tract disorders based on the categorization of characteristics taken from endoscopic pictures using a vision transformer and transfer learning model. Vision transformer has shown very promi… ▽ More

    Submitted 6 October, 2022; originally announced October 2022.

  50. arXiv:2209.07387  [pdf, other

    hep-th

    Holomorphic Surface Defects in Four-Dimensional Chern-Simons Theory

    Authors: Ahsan Z. Khan

    Abstract: We derive the framing anomaly of four-dimensional holomorphic-topological Chern-Simons theory formulated on the product of a topological surface and the complex plane. We show that the presence of this anomaly allows one to couple four-dimensional Chern-Simons theory to holomorphic field theories with Kac-Moody symmetry, where the Kac-Moody level $k$ is critical $k=-h^{\vee}$. Applying this result… ▽ More

    Submitted 15 September, 2022; originally announced September 2022.

    Comments: 41 pages