Optimizing ChatGPT’s image analysis

Artificial intelligence like ChatGPT has become a powerful tool in image analysis, but its performance often declines when dealing with low resolution, reduced color depth, blur, or noise. This project addresses these challenges through three key objectives which is ChatGPT capability evaluation,...

Full description

Saved in:

Bibliographic Details
Main Author:	Chong, Michi
Format:	Final Year Project / Dissertation / Thesis
Published:	2025
Subjects:	T Technology (General)
Online Access:	http://eprints.utar.edu.my/7136/1/fyp_CS_2025_CM.pdf http://eprints.utar.edu.my/7136/
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1854094481597399040
author	Chong, Michi
author_facet	Chong, Michi
author_sort	Chong, Michi
building	UTAR Library
collection	Institutional Repository
content_provider	Universiti Tunku Abdul Rahman
content_source	UTAR Institutional Repository
continent	Asia
country	Malaysia
description	Artificial intelligence like ChatGPT has become a powerful tool in image analysis, but its performance often declines when dealing with low resolution, reduced color depth, blur, or noise. This project addresses these challenges through three key objectives which is ChatGPT capability evaluation, ChatGPT preprocessing optimization, and image quality restore module implementation. Together, these steps aim to establish clear performance limits for ChatGPT, extend its capability through restoration techniques, and design a robust pipeline for real-world applications. Firstly, ChatGPT capability evaluation was conducted to determine ChatGPT’s thresholds for reliable image analysis. Systematic testing revealed that a minimum resolution of 512px with 24-bit RGB color depth provided the most consistent balance between accuracy and efficiency. Performance dropped sharply when resolution was reduced to 256px or lower, while higher distortion levels of blur and noise above 15% significantly impaired accuracy. Further analysis showed that bar and line charts were more vulnerable to distortion than pie charts, highlighting differences in sensitivity across visualization types. These experiments established precise thresholds for resolution, color depth, blur, and noise, providing a baseline for effective and consistent analysis. Furthermore, the model preprocessing optimization and image quality restoration methods were explored to optimize good inputs and restore degraded inputs. For images already meeting the thresholds, optimization was performed to reduce computational load. The optimization is using an HSV-based background removal technique, where a saturation threshold of 15 effectively reduced file size while maintaining accuracy. For degraded inputs, an image restoration module was developed. Comparative testing demonstrated that PixelCut AI consistently outperformed DeepImage AI in deblurring, while a trained DnCNN model exceeded morphology-based approaches in denoising, particularly under high noise levels. These findings confirmed that advanced restoration techniques can extend ChatGPT’s capacity to analyse images that would otherwise fall below acceptable quality levels. Moreover, to optimize or restore the image, the implementation was realized through an integrated image processing pipeline designed to balance efficiency with reliability. The pipeline begins with quality evaluation to assess resolution, color depth, blur, and noise. Good-quality images are optimized to reduce computational load, while degraded inputs undergo restoration through resizing, deblurring, or denoising until they meet the minimum thresholds. A validation stage then ensures all processed images satisfy the required standards before being analysed by ChatGPT. This structure allows the system to optimize clear images while reliably enhancing poor-quality inputs, ensuring consistent results across varied conditions. The project culminated in the development of the ChatGPT AI Vision Assistant, implemented in Streamlit, which supports both text and image queries while integrating the image processing pipeline and a replot function for dynamic chart visualization. This system enables users to test quality thresholds, experience optimized analysis and observe the benefits of restoration methods. Overall, the project defines precise thresholds for resolution, color depth, blur, and noise, enhances degraded images with optimized and trained restoration models, and implements a complete pipeline that balances efficiency with reliability. Together, these outcomes deliver a robust framework that significantly strengthens ChatGPT’s reliability in real-world image analysis tasks.
format	Final Year Project / Dissertation / Thesis
id	my-utar-eprints.7136
institution	Universiti Tunku Abdul Rahman
publishDate	2025
record_format	eprints
spelling	my-utar-eprints.71362025-12-28T16:05:24Z Optimizing ChatGPT’s image analysis Chong, Michi T Technology (General) Artificial intelligence like ChatGPT has become a powerful tool in image analysis, but its performance often declines when dealing with low resolution, reduced color depth, blur, or noise. This project addresses these challenges through three key objectives which is ChatGPT capability evaluation, ChatGPT preprocessing optimization, and image quality restore module implementation. Together, these steps aim to establish clear performance limits for ChatGPT, extend its capability through restoration techniques, and design a robust pipeline for real-world applications. Firstly, ChatGPT capability evaluation was conducted to determine ChatGPT’s thresholds for reliable image analysis. Systematic testing revealed that a minimum resolution of 512px with 24-bit RGB color depth provided the most consistent balance between accuracy and efficiency. Performance dropped sharply when resolution was reduced to 256px or lower, while higher distortion levels of blur and noise above 15% significantly impaired accuracy. Further analysis showed that bar and line charts were more vulnerable to distortion than pie charts, highlighting differences in sensitivity across visualization types. These experiments established precise thresholds for resolution, color depth, blur, and noise, providing a baseline for effective and consistent analysis. Furthermore, the model preprocessing optimization and image quality restoration methods were explored to optimize good inputs and restore degraded inputs. For images already meeting the thresholds, optimization was performed to reduce computational load. The optimization is using an HSV-based background removal technique, where a saturation threshold of 15 effectively reduced file size while maintaining accuracy. For degraded inputs, an image restoration module was developed. Comparative testing demonstrated that PixelCut AI consistently outperformed DeepImage AI in deblurring, while a trained DnCNN model exceeded morphology-based approaches in denoising, particularly under high noise levels. These findings confirmed that advanced restoration techniques can extend ChatGPT’s capacity to analyse images that would otherwise fall below acceptable quality levels. Moreover, to optimize or restore the image, the implementation was realized through an integrated image processing pipeline designed to balance efficiency with reliability. The pipeline begins with quality evaluation to assess resolution, color depth, blur, and noise. Good-quality images are optimized to reduce computational load, while degraded inputs undergo restoration through resizing, deblurring, or denoising until they meet the minimum thresholds. A validation stage then ensures all processed images satisfy the required standards before being analysed by ChatGPT. This structure allows the system to optimize clear images while reliably enhancing poor-quality inputs, ensuring consistent results across varied conditions. The project culminated in the development of the ChatGPT AI Vision Assistant, implemented in Streamlit, which supports both text and image queries while integrating the image processing pipeline and a replot function for dynamic chart visualization. This system enables users to test quality thresholds, experience optimized analysis and observe the benefits of restoration methods. Overall, the project defines precise thresholds for resolution, color depth, blur, and noise, enhances degraded images with optimized and trained restoration models, and implements a complete pipeline that balances efficiency with reliability. Together, these outcomes deliver a robust framework that significantly strengthens ChatGPT’s reliability in real-world image analysis tasks. 2025-06 Final Year Project / Dissertation / Thesis NonPeerReviewed application/pdf http://eprints.utar.edu.my/7136/1/fyp_CS_2025_CM.pdf Chong, Michi (2025) Optimizing ChatGPT’s image analysis. Final Year Project, UTAR. http://eprints.utar.edu.my/7136/
spellingShingle	T Technology (General) Chong, Michi Optimizing ChatGPT’s image analysis
title	Optimizing ChatGPT’s image analysis
title_full	Optimizing ChatGPT’s image analysis
title_fullStr	Optimizing ChatGPT’s image analysis
title_full_unstemmed	Optimizing ChatGPT’s image analysis
title_short	Optimizing ChatGPT’s image analysis
title_sort	optimizing chatgpt’s image analysis
topic	T Technology (General)
url	http://eprints.utar.edu.my/7136/1/fyp_CS_2025_CM.pdf http://eprints.utar.edu.my/7136/
url_provider	http://eprints.utar.edu.my

Optimizing ChatGPT’s image analysis

Similar Items