| Faculty of Computers and Information

Discuss graduation projects for summer semester students

Read more about Discuss graduation projects for summer semester students

اعــــــــــلان هـــــــــــام

لمناقشة مشاريع التخرج الفصل الصيفي 2021 /2022

سيتم مناقشة مشاريع التخرج لطلاب الفصل الدراسي الصيفي للعام الجامعي 2021/2022 م يوم الاحد الموافق 25/9/2022 م 0

20 September 2022

Invite you to attend the discussion of the master's thesis of the researcher / Badriya Dahi

Read more about Invite you to attend the discussion of the master's thesis of the researcher / Badriya Dahi

7676

20 September 2022

Kohtd: Kazakh offline handwritten text dataset

Read more about Kohtd: Kazakh offline handwritten text dataset

Research Abstract

Despite the transition to digital information exchange, many documents, such as invoices, taxes, memos and questionnaires, historical data, and answers to exam questions, still require handwritten inputs. In this regard, there is a need to implement Handwritten Text Recognition (HTR) which is an automatic way to decrypt records using a computer. Handwriting recognition is challenging because of the virtually infinite number of ways a person can write the same message. For this proposal we introduce Kazakh handwritten text recognition research, a comprehensive dataset of Kazakh handwritten texts is necessary. This is particularly true given the lack of a dataset for handwritten Kazakh text. In this paper, we proposed our extensive Kazakh offline Handwritten Text dataset (KOHTD), which has 3000 handwritten exam papers and more than 140335 segmented images and there are approximately 922010 symbols

Research Authors

Nazgul Toiganbayeva, Mahmoud Kasem, Galymzhan Abdimanap, Kairat Bostanbekov, Abdelrahman Abdallah, Anel Alimova, Daniyar Nurseitov

Research Date

Sat, 16 Jul 2022 - 12:00

Research Department

Multimedia Department

Research Journal

Signal Processing: Image Communication

Research Member

Mahmoud Salah Eldin Mohammed

abdoelsayed2016

Research Year

2022

D-ARP: An Efficient Scheme to Detect and Prevent ARP Spoofing

Read more about D-ARP: An Efficient Scheme to Detect and Prevent ARP Spoofing

Research Abstract

Nowadays, cyber-attack is a severe criminal violation, and it is one of the most active fields of research. A Man-in-the-middle attack (MITM) is a type of cyber-attack in which an unauthorized third party secretly accesses the communication between two hosts in the same network to read/modify the transferred data between them. ARP spoofing-based MITM attack exploits ARP protocol weakness where the attacker associates its MAC address with the IP address of an intended legitimate host. Although there are many defense approaches for ARP spoofing based-MITM attacks, these methods are uncompleted or have a performance overhead since they modify the original ARP protocol. Also, some of these approaches depend on the centralized server, which is a single point of failure. This paper presents a detection scheme for ARP spoofing-based MITM attack called D-ARP, which is compatible with the original ARP protocol. The main idea of D-ARP is to send an ARP packet signed with a key in parallel with the original ARP packets to make a correlation between requests and replies. Each host records all types of signing ARP packets in a log file. Based on this correlation, D-ARP matches the injected key to detect ARP spoofing if there is a duplicate or conflict in the MAC address. For more reliability, D-ARP uses the DHCP server and the Nmap feature to detect the MAC addresses of MITM attackers. Moreover, this scheme also offers a module for Admin to create a trusted list of hosts. The experimental results show that D-ARP is highly effective for detecting and preventing ARP spoofing with zero false positives and zero false negative probabilities.

Research Authors

SABAH M. MORSY AND DALIA NASHAT

Research Date

Thu, 12 May 2022 - 12:00

Research Department

Information Technology Department

Research Member

Dalia Mohamed Nashat Mohamed

Automated Question Answer medical model based on Deep Learning Technology

Read more about Automated Question Answer medical model based on Deep Learning Technology

Research Abstract

Artificial intelligence can now provide more solutions for different problems, especially in the medical field. One of those problems is the lack of answers to any given medical/health-related question. The Internet is full of forums that allow people to ask some specific questions and get great answers for them. Nevertheless, browsing these questions to locate a similar case to your own question, also finding a satisfying accurate answer is difficult and timeconsuming task. This research will introduce a solution to these problems by automating the process of generating qualified answers to these questions and creating a kind of digital doctor. Furthermore, this research will train an end-to-end model using the framework of RNN and the encoder decoder to generate sensible and useful answers to a small set of medical/health issues. The proposed model was trained and evaluated using data from various online services …

Research Authors

Abdelrahman Abdallah, Mahmoud Kasem, Mohamed Hamada, Shaymaa Sdeek

Research Date

Thu, 21 May 2020 - 12:00

Research Department

Information Technology Department

Research Journal

ICEMIS'20: Proceedings of the 6th International Conference on Engineering & MIS 2020

Research Member

abdoelsayed2016

Classification of Handwritten Names of Cities and Handwritten Text Recognition using Various Deep Learning Models

Read more about Classification of Handwritten Names of Cities and Handwritten Text Recognition using Various Deep Learning Models

Research Abstract

This article discusses the problem of handwriting recognition in Kazakh and Russian languages. This area is poorly studied since in the literature there are almost no works in this direction. We have tried to describe various approaches and achievements of recent years in the development of handwritten recognition models in relation to Cyrillic graphics. The first model uses deep convolutional neural networks (CNNs) for feature extraction and a fully connected multilayer perceptron neural network (MLP) for word classification. The second model, called SimpleHTR, uses CNN and recurrent neural network (RNN) layers to extract information from images. We also proposed the Bluechet and Puchserver models to compare the results. Due to the lack of available open datasets in Russian and Kazakh languages, we carried out work to collect data that included handwritten names of countries and cities from 42 different Cyrillic words, written more than 500 times in different handwriting. We also used a handwritten database of Kazakh and Russian languages (HKR). This is a new database of Cyrillic words (not only countries and cities) for the Russian and Kazakh languages, created by the authors of this work.

Research Authors

Daniyar Nurseitov, Kairat Bostanbekov, Maksat Kanatov, Anel Alimova, Abdelrahman Abdallah, Galymzhan Abdimanap

Research Date

Thu, 10 Dec 2020 - 12:00

Research Department

Information Technology Department

Research Journal

Advances in Science, Technology and Engineering Systems Journal (ASTESJ)

Research Member

abdoelsayed2016

Attention-based fully gated cnn-bgru for russian handwritten text‏

Read more about Attention-based fully gated cnn-bgru for russian handwritten text‏

Research Abstract

This article considers the task of handwritten text recognition using attention-based encoder–decoder networks trained in the Kazakh and Russian languages. We have developed a novel deep neural network model based on a fully gated CNN, supported by multiple bidirectional gated recurrent unit (BGRU) and attention mechanisms to manipulate sophisticated features that achieve 0.045 Character Error Rate (CER), 0.192 Word Error Rate (WER), and 0.253 Sequence Error Rate (SER) for the first test dataset and 0.064 CER, 0.24 WER and 0.361 SER for the second test dataset. Our proposed model is the first work to handle handwriting recognition models in Kazakh and Russian languages. Our results confirm the importance of our proposed Attention-Gated-CNN-BGRU approach for training handwriting text recognition and indicate that it can lead to statistically significant improvements (p-value < 0.05) in the sensitivity (recall) over the tests dataset. The proposed method’s performance was evaluated using handwritten text databases of three languages: English, Russian, and Kazakh. It demonstrates better results on the Handwritten Kazakh and Russian (HKR) dataset than the other well-known models.

Research Authors

Abdelrahman Abdallah, Mohamed Hamada, Daniyar Nurseitov

Research Date

Wed, 1 Dec 2021 - 12:00

Research Department

Information Technology Department

Research Journal

Journal of Imaging

Research Member

abdoelsayed2016

Handwritten Kazakh and Russian (HKR) database for text recognition

Read more about Handwritten Kazakh and Russian (HKR) database for text recognition

Research Abstract

In this paper, we introduce a large-scale dataset, called HKR, to address challenging detection and recognition problems of handwritten Russian and Kazakh text in scanned documents. We present a new Russian and Kazakh database (with about 95% of Russian and 5% of Kazakh words/sentences respectively) for offline handwriting recognition. A few pre-processing and segmentation procedures have been developed together with the database. The database is written in Cyrillic and shares the same 33 characters. Besides these characters, the Kazakh alphabet also contains 9 additional specific characters. This dataset is a collection of forms. The sources of all the forms in the datasets were generated by LaTeXwhich subsequently was filled out by persons with their handwriting. The database consists of more than 1500 filled forms. There are approximately 63000 sentences, more than 715699 symbols produced by approximately 200 different writers. It can serve researchers in the field of handwriting recognition tasks by using deep and machine learning. For experiments, we used several popular text recognition methods for word and line recognition like CTC-based and attention-based methods. The results indicate the diversity of HKR. The dataset is available at https://github.com/abdoelsayed2016/HKR_Dataset.

Research Authors

Daniyar Nurseitov, Kairat Bostanbekov, Daniyar Kurmankhojayev, Anel Alimova, Abdelrahman Abdallah & Rassul Tolegenov

Research Date

Wed, 1 Sep 2021 - 12:00

Research Department

Information Technology Department

Research Journal

Multimedia Tools and Applications

Research Member

abdoelsayed2016

Ontology Based Natural Language Queries Transformation into SPARQL Queries

Read more about Ontology Based Natural Language Queries Transformation into SPARQL Queries

Research Authors

Majid ASKAR, Alsayed ALGERGAWY, Taysir Hassan A SOLIMAN, Birgitta KONIG-RIES, Adel A SEWISY

Research Date

Mon, 14 Dec 2020 - 12:00

Research Department

Computer Science Department

Research Journal

Baltic Journal of Modern Computing (BJMC)

Research Member

Majid Ahmed Jad EL-Rab El-Sayed Askar

Taiseer Hassan Abdel-Hamid Sulaiman

Adel Abo El-Magd Sewesy Mohamed

Research Pages

719–731

Research Vol

9

Research Year

2020

TNCR: Table Net Detection and Classification Dataset

Read more about TNCR: Table Net Detection and Classification Dataset

Research Abstract

We present TNCR, a new table dataset with varying image quality collected from open access websites. TNCR dataset can be used for table detection in scanned document images and their classification into 5 different classes. TNCR contains 9428 labeled tables with approximately 6621 images . In this paper, we have implemented state-of-the-art deep learning-based methods for table detection to create several strong baselines. Deformable DERT with Resnet-50 Backbone Network achieves the highest performance compared to other methods with a precision of 86.7%, recall of 89.6%, and f1 score of 88.1% on the TNCR dataset. We have made TNCR open source in the hope of encouraging more deep learning approaches to table detection, classification and structure recognition. The dataset and trained model checkpoints are available at https://github.com/abdoelsayed2016/TNCR_Dataset.

Research Authors

AbdelrahmanAbdallah, AlexanderBerendeyev, IslamNuradin, DaniyarNurseitov

Research Date

Tue, 14 Dec 2021 - 12:00

Research Department

Information Technology Department

Research Journal

Neurocomputing

Research Member

abdoelsayed2016

Research Rank

1

Research Year

2021