Arama Sonuçları

Listeleniyor 1 - 6 / 6
  • Yayın
    A robust Gradient boosting model based on SMOTE and NEAR MISS methods for intrusion detection in imbalanced data sets
    (Işık Üniversitesi, 2022-01-18) Arık, Ahmet Okan; Çavdaroğlu Akkoç, Gülsüm Çiğdem; Işık Üniversitesi, Lisansüstü Eğitim Enstitüsü, Enformasyon Teknolojileri Yüksek Lisans Programı
    Novel technologies cause many security vulnerabilities and zero-day attack risks. Intrusion Detection Systems (IDS) are developed to protect computer networks from threats and attacks. Many challenging problems need to be solved in existing methods. The class imbalance problem is one of the most difficult problems of IDS, and it reduces the detection rate performance of the classifiers. The highest IDS detection rate in the literature is 96.54%. This thesis proposes a new model called ROGONG-IDS (Robust Gradient Boosting) based on Gradient Boosting. ROGONGIDS model uses Synthetic Minority Over-Sampling Technique (SMOTE) and Near Miss methods to handle class imbalance. Three different gradient boosting-based classification algorithms (GBM, LightGBM, XGBoost) were compared. The performance of the proposed model on multiclass classification has been verified in the UNSW-NB15 dataset. It reached the highest attack detection rate and F1 score in the literature with a 97.30% detection rate and 97.65% F1 score. ROGONG-IDS provides a robust, efficient solution for IDS built on datasets with the imbalanced class distribution. It outperforms state-of-the-art and traditional intrusion detection methods.
  • Yayın
    Rule based entity-relationship diagram modelling
    (Işık Üniversitesi, 2022-02-07) Ulusoy, Oğuzhan; Ekin, Emine; Işık Üniversitesi, Lisansüstü Eğitim Enstitüsü, Bilgisayar Mühendisliği Yüksek Lisans Programı
    Modern society needs to use database system since they involve many activities that are related to database interaction directly. In this study, entity-relationship modeling using Natural Language Processing techniques is presented for the English language. Natural Language Processing refers to the capability of understanding human languages naturally, like Turkish and English, using computational power. To make this possible, combination of linguistics and current Machine Learning systems are used together. Entity-Relationship diagrams ensure to plan or trace relational databases in different fields. In the beginning, all details of a standard database management and its components have been studied. Heuristic rules which indicate the relation between human language and database components have been defined. According to the defined heuristic rules previously, an event-based pipeline has been constructed. A full text has been analyzed and processed every word at this pipeline using Natural Language Processing techniques.
  • Yayın
    Fingertip electrocardiogram and speech signal based biometric recognition system
    (Işık Üniversitesi, 2021-12-27) Güven, Gökhan; Güz, Ümit; Gürkan, Hakan; Işık Üniversitesi, Lisansüstü Eğitim Enstitüsü, Elektronik Mühendisliği Doktora Programı
    Fingertip electrocardiogram and speech signal based biometric recognition system In this research work, we presented a one-dimensional CNN-based person identification system which depends on the combination of both speech and ECG modalities to improve the overall performance compared to traditional systems. The proposed method has two approach: one is to develop combination of textindependent speech and fingertip ECG fusion system, the other one is to develop a robust rejection algorithm to prevent unauthorized access to the fusion system. In addition to the system robustness, we have developed an ECG spike and inconsistent beats removing algorithm, which detect and remove the problems caused by either portable fingertip ECG devices or movements of the patients. First approach has been tested on 30, 45, 60, 75 and 90 people which were taken from LibriSpeech Corpus database and combination of both CYBHi and our private fingertip ECG database. The 3-fold cross validation test setup has been conducted while system working time was set to 10 seconds. In the first experiment, we achieved 90.22% accuracy rate for 90 people for ECG based system. For the speech based system, 97.94% accuracy rate has achieved for 90 people. For the combination of both system, 99.92% accuracy rate has been achieved. For the second approach, 90 people for ECG and Speech database were being used as genuine class, 26 people as imposter class, and after the performance evaluation in optimum rejection thresholds, 71.08% accuracy rate for imposters rejection and 71.05% accuracy rate for genuine recognition has achieved for ECG based system. For the speech based system, imposter class were 87.82% accurately rejected while genuine classes were 86.48% accurately identified. The combination of both system has achieved 91.68% accuracy for genuine identification rate whereas 96.05% accuracy for imposter rejection.
  • Yayın
    Learning to rank
    (Işık Üniversitesi, 2011-04-28) Kılıç, Yasin Ozan; Yıldız, Olcay Taner; Işık Üniversitesi, Fen Bilimleri Enstitüsü, Bilgisayar Mühendisliği Yüksek Lisans Programı
    The web has grown so rapidly in the last decade and it brought the need for proper ranking. Learning to rank (LTR) is the collection of machine learning technolo- gies that construct a ranking model using training data. The model can sort documents according to their degrees of relevance or preference. In this thesis, we introduce LTR technologies and divide them into three ap- proaches: the point-wise, pair-wise and list-wise. We review the theoritical aspects of each category and introduce the representative algorithms of them. We also introduce a new LTR method GRwC which uses classifîcation and graph algorithms. We reduce the ranking problem to a two class classifîcation problem and apply KNN algorithm on a modified LTR dataset. We compared it with the popular ranking algorithm RankingSVM. Experiments on the well-known ranking datasets show that our proposed method gives slightly worse results than RankingSVM.
  • Yayın
    Word sense disambiguation, named entity recognition, and shallow parsing tasks for Turkish
    (Işık Üniversitesi, 2019-04-02) Topsakal, Ozan; Yıldız, Olcay Taner; Işık Üniversitesi, Fen Bilimleri Enstitüsü, Bilgisayar Mühendisliği Yüksek Lisans Programı
    People interactions are based on sentences. The process of understanding sentences is thru converging, parsing the words and making sense of words. The ultimate goal of Natural Language Processing is to understand the meaning of sentences. There are three main areas that are the topics of this thesis, namely, Named Entity Recognition, Shallow Parsing, and Word Sense Disambiguation. The Natural Language Processing algorithms that learn entities, like person, location, time etc. are called Named Entity Recognition algorithms. Parsing sentences is one of the biggest challenges in Natural Language Processing. Since time efficiency and accuracy are inversely proportional with each other, one of the best ideas is to use shallow parsing algorithms to deal with this challenge. Many of words have more than one meaning. Recognizing the correct meaning that is used in a sentence is a difficult problem. In Word Sense Disambiguation literature there are lots of algorithms that can help to solve this problem. This thesis tries to find solutions to these three challenges by applying machine learning trained algorithms. Experiments are done on a dataset, containing 9,557 sentences.
  • Yayın
    Supervised decision making in forex investment using ML and DL classification methods
    (Işık Üniversitesi, 2023-07-20) Jiroudi, Abdullah; Eskil, Mustata Taner; Işık Üniversitesi, Lisansüstü Eğitim Enstitüsü, Bilgisayar Mühendisliği Yüksek Lisans Programı; Işık University, School of Graduate Studies, Master’s Program in Computer Engineering
    The suggested trading system offers an approach that takes into account the complexity and high trading volume of the foreign exchange (FX0) market. Its main objective is to address the challenges faced by traders in the GBP/JPY currency pair and assist them in making quick decisions. To achieve this, machine learning and deep learning techniques are integrated to propose a trading algorithm. The proposed algorithm works by combining data from different time intervals. The Long Short-Term Memory (LSTM) model is used to predict indicator values, while the XGBoost classifier is employed to determine trading decisions. This method aims to adapt to rapidly changing patterns in the forex market and enables the detection of subtle changes in price dynamics through a sliding window training approach. Experiments conducted have shown promising results for the suggested trading system. Positive outcomes have been obtained in terms of capital growth and prediction accuracy. However, since this method is highly risky and requires further development in terms of risk management, the inclusion of risk management techniques and algorithm optimization is targeted. This study contributes to the improvement of trading strategies while bridging the gap between researchers and traders. It also demonstrates the potential of machine learning and deep learning techniques to enhance decision-making processes in financial markets. This trading system offers traders a range of advantages. The utilization of machine learning and deep learning techniques enables rapid analysis of large amounts of data and decision-making capabilities. Additionally, by combining data from different time intervals, it becomes possible to evaluate long-term trends and short-term fluctuations more effectively. In conclusion, the suggested trading system empowers traders to be competitive in the forex market and achieve better outcomes. Furthermore, it contributes to the increased utilization of machine learning and deep learning techniques in financial markets and encourages further research in the field.