Işık Üniversitesi Kurumsal Akademik Belleği :: DSpace Angular

Arama Sonuçları

Listeleniyor 1 - 10 / 11

Developing an efficient deep neural network for automatic detection of COVID-19 using chest X-ray images
(Elsevier B.V., 2021-06) Sheykhivand, Sobhan; Mousavi, Zohreh; Mojtahedi, Sina; Yousefi Rezaii, Tohid; Farzamnia, Ali; Meshgini, Saeed; Saad, Ismail
The novel coronavirus (COVID-19) could be described as the greatest human challenge of the 21st century. The development and transmission of the disease have increased mortality in all countries. Therefore, a rapid diagnosis of COVID-19 is necessary to treat and control the disease. In this paper, a new method for the automatic identification of pneumonia (including COVID-19) is presented using a proposed deep neural network. In the proposed method, the chest X-ray images are used to separate 2–4 classes in 7 different and functional scenarios according to healthy, viral, bacterial, and COVID-19 classes. In the proposed architecture, Generative Adversarial Networks (GANs) are used together with a fusion of the deep transfer learning and LSTM networks, without involving feature extraction/selection for classification of pneumonia. We have achieved more than 90% accuracy for all scenarios except one and also achieved 99% accuracy for separating COVID-19 from healthy group. We also compared our deep proposed network with other deep transfer learning networks (including Inception-ResNet V2, Inception V4, VGG16 and MobileNet) that have been recently widely used in pneumonia detection studies. The results based on the proposed network were very promising in terms of accuracy, precision, sensitivity, and specificity compared to the other deep transfer learning approaches. Depending on the high performance of the proposed method, it can be used during the treatment of patients.
CNN-Based deep learning architecture for electromagnetic imaging of rough surface profiles
(IEEE, 2022-10) Aydın, İzde; Budak, Güven; Sefer, Ahmet; Yapar, Ali
A convolutional neural network (CNN) based deep learning (DL) technique for electromagnetic imaging of rough surfaces separating two dielectric media is presented. The direct scattering problem is formulated through the conventional integral equations and the synthetic scattered field data is produced by a fast numerical solution technique which is based on Method of Moments (MoM). Two different special CNN architectures are designed and implemented for the solution of the inverse rough surface imaging problem wherein both random and deterministic rough surface profiles can be imaged. It is shown by a comprehensive numerical analysis that the proposed deep-learning (DL) inversion scheme is very effective and robust.
Adaptive convolution kernel for artificial neural networks
(Academic Press Inc., 2021-02) Tek, Faik Boray; Çam, İlker; Karlı, Deniz
Many deep neural networks are built by using stacked convolutional layers of fixed and single size (often 3 × 3) kernels. This paper describes a method for learning the size of convolutional kernels to provide varying size kernels in a single layer. The method utilizes a differentiable, and therefore backpropagation-trainable Gaussian envelope which can grow or shrink in a base grid. Our experiments compared the proposed adaptive layers to ordinary convolution layers in a simple two-layer network, a deeper residual network, and a U-Net architecture. The results in the popular image classification datasets such as MNIST, MNIST-CLUTTERED, CIFAR-10, Fashion, and ‘‘Faces in the Wild’’ showed that the adaptive kernels can provide statistically significant improvements on ordinary convolution kernels. A segmentation experiment in the Oxford-Pets dataset demonstrated that replacing ordinary convolution layers in a U-shaped network with 7 × 7 adaptive layers can improve its learning performance and ability to generalize.
Closeness and uncertainty aware adversarial examples detection in adversarial machine learning
(Elsevier Ltd, 2022-07) Tuna, Ömer Faruk; Çatak, Ferhat Özgür; Eskil, Mustafa Taner
While deep learning models are thought to be resistant to random perturbations, it has been demonstrated that these architectures are vulnerable to deliberately crafted perturbations, albeit being quasi-imperceptible. These vulnerabilities make it challenging to deploy Deep Neural Network (DNN) models in security-critical areas. Recently, many research studies have been conducted to develop defense techniques enabling more robust models. In this paper, we target detecting adversarial samples by differentiating them from their clean equivalents. We investigate various metrics for detecting adversarial samples. We first leverage moment-based predictive uncertainty estimates of DNN classifiers derived through Monte-Carlo (MC) Dropout Sampling. We also introduce a new method that operates in the subspace of deep features obtained by the model. We verified the effectiveness of our approach on different datasets. Our experiments show that these approaches complement each other, and combined usage of all metrics yields 99 % ROC-AUC adversarial detection score for well-known attack algorithms.
Recovery of impenetrable rough surface profiles via CNN-based deep learning architecture
(Taylor and Francis Ltd., 2022-08-18) Aydın, İzde; Budak, Güven; Sefer, Ahmet; Yapar, Ali
In this paper, a convolutional neural network (CNN)-based deep learning (DL) architecture for the solution of an electromagnetic inverse problem related to imaging of the shape of the perfectly electric conducting (PEC) rough surfaces is addressed. The rough surface is illuminated by a plane wave and scattered field data is obtained synthetically through the numerical solution of surface integral equations. An effective CNN-DL architecture is implemented through the modelling of the rough surface variation in terms of convenient spline type base functions. The algorithm is numerically tested with various scenarios including amplitude only data and shown that it is very effective and useful.
Uncertainty as a Swiss army knife: new adversarial attack and defense ideas based on epistemic uncertainty
(Springer, 2022-04-02) Tuna, Ömer Faruk; Çatak, Ferhat Özgür; Eskil, Mustafa Taner
Although state-of-the-art deep neural network models are known to be robust to random perturbations, it was verified that these architectures are indeed quite vulnerable to deliberately crafted perturbations, albeit being quasi-imperceptible. These vulnerabilities make it challenging to deploy deep neural network models in the areas where security is a critical concern. In recent years, many research studies have been conducted to develop new attack methods and come up with new defense techniques that enable more robust and reliable models. In this study, we use the quantified epistemic uncertainty obtained from the model's final probability outputs, along with the model's own loss function, to generate more effective adversarial samples. And we propose a novel defense approach against attacks like Deepfool which result in adversarial samples located near the model's decision boundary. We have verified the effectiveness of our attack method on MNIST (Digit), MNIST (Fashion) and CIFAR-10 datasets. In our experiments, we showed that our proposed uncertainty-based reversal method achieved a worst case success rate of around 95% without compromising clean accuracy.
Exploiting epistemic uncertainty of the deep learning models to generate adversarial samples
(Springer, 2022-03) Tuna, Ömer Faruk; Çatak, Ferhat Özgür; Eskil, Mustafa Taner
Deep neural network (DNN) architectures are considered to be robust to random perturbations. Nevertheless, it was shown that they could be severely vulnerable to slight but carefully crafted perturbations of the input, termed as adversarial samples. In recent years, numerous studies have been conducted in this new area called ``Adversarial Machine Learning” to devise new adversarial attacks and to defend against these attacks with more robust DNN architectures. However, most of the current research has concentrated on utilising model loss function to craft adversarial examples or to create robust models. This study explores the usage of quantified epistemic uncertainty obtained from Monte-Carlo Dropout Sampling for adversarial attack purposes by which we perturb the input to the shifted-domain regions where the model has not been trained on. We proposed new attack ideas by exploiting the difficulty of the target model to discriminate between samples drawn from original and shifted versions of the training data distribution by utilizing epistemic uncertainty of the model. Our results show that our proposed hybrid attack approach increases the attack success rates from 82.59% to 85.14%, 82.96% to 90.13% and 89.44% to 91.06% on MNIST Digit, MNIST Fashion and CIFAR-10 datasets, respectively.
Sarcasm detection on news headlines using transformers
(Springer, 2025-09-07) Gümüşçekiçci, Gizem; Dehkharghani, Rahim
Sarcasm poses a linguistic challenge due to its figurative nature, where intended meaning contradicts literal interpretation. Sarcasm is prevalent in human communication, affecting interactions in literature, social media, news, e-commerce, etc. Identifying the true intent behind sarcasm is challenging but essential for applications in sentiment analysis. Detecting sarcasm in written text, as a challenging task, has attracted many researchers in recent years. This paper attempts to detect sarcasm in news headlines. Journalists prefer using sarcastic news headlines as they seem much more interesting to the readers. In the proposed methodology, we experimented with Transformers, namely the BERT model, and several Machine and Deep Learning models with different word and sentence embedding methods. The proposed approach inherently requires high-performance resources due to the use of large-scale pre-trained language models such as BERT. We also extended an existing news headlines dataset for sarcasm detection using augmentation techniques and annotating it with hand-crafted features. The proposed methodology could outperform almost all existing sarcasm detection approaches with a 98.86% F1-score when applied to the extended news headlines dataset, which we made publicly available on GitHub.
Automated diagnosis of Alzheimer’s Disease using OCT and OCTA: a systematic review
(Institute of Electrical and Electronics Engineers Inc., 2024-08-06) Turkan, Yasemin; Tek, Faik Boray; Arpacı, Fatih; Arslan, Ozan; Toslak, Devrim; Bulut, Mehmet; Yaman, Aylin
Retinal optical coherence tomography (OCT) and optical coherence tomography angiography (OCTA) have emerged as promising, non-invasive, and cost-effective modalities for the early diagnosis of Alzheimer's disease (AD). However, a comprehensive review of automated deep learning techniques for diagnosing AD or mild cognitive impairment (MCI) using OCT/OCTA data is lacking. We addressed this gap by conducting a systematic review using the Preferred Reporting Items for Systematic Review and Meta-Analysis (PRISMA) guidelines. We systematically searched databases, including Scopus, PubMed, and Web of Science, and identified 16 important studies from an initial set of 4006 references. We then analyzed these studies through a structured framework, focusing on the key aspects of deep learning workflows for AD/MCI diagnosis using OCT-OCTA. This included dataset curation, model training, and validation methodologies. Our findings indicate a shift towards employing end-to-end deep learning models to directly analyze OCT/OCTA images in diagnosing AD/MCI, moving away from traditional machine learning approaches. However, we identified inconsistencies in the data collection methods across studies, leading to varied outcomes. We emphasize the need for longitudinal studies on early AD and MCI diagnosis, along with further research on interpretability tools to enhance model accuracy and reliability for clinical translation.
Text-to-SQL: a methodical review of challenges and models
(TÜBİTAK, 2024-05-20) Kanburoğlu, Ali Buğra; Tek, Faik Boray
This survey focuses on Text-to-SQL, automated translation of natural language queries into SQL queries. Initially, we describe the problem and its main challenges. Then, by following the PRISMA systematic review methodology, we survey the existing Text-to-SQL review papers in the literature. We apply the same method to extract proposed Text-to-SQL models and classify them with respect to used evaluation metrics and benchmarks. We highlight the accuracies achieved by various models on Text-to-SQL datasets and discuss execution-guided evaluation strategies. We present insights into model training times and implementations of different models. We also explore the availability of Text-to-SQL datasets in non-English languages. Additionally, we focus on large language model (LLM) based approaches for the Text-to-SQL task, where we examine LLM-based studies in the literature and subsequently evaluate the LLMs on the cross-domain Spider dataset. Finally, we conclude with a discussion of future directions for Text-to-SQL research, identifying potential areas of improvement and advancements in this field.

Filtreler

Yazar

Konu

Tarih

İndeks

WoS Q

Scopus Q

Dil

Tür

Kategori

Bölüm

Erişim Hakkı

Tam Metin

Öğe Türü

Ayarlar

Sırala

Sayfa Başına Sonuç

Arama Sonuçları