Hello, I’m Kemal!

I’m a Machine Learning Engineer specializing in creating intelligent models and scalable AI solutions. Passionate about solving real-world problems through advanced machine learning techniques.

Get In Touch

Whether you have a question, want to collaborate on a project, or just want to say hi, feel free to drop me a message. I’m always open to exciting opportunities!

📧 Email Me 💼 Connect on LinkedIn 🐙 GitHub

About Me

Welcome to my portfolio! I’m Kemal Bayık, a Machine Learning Engineer with a strong passion for solving complex problems using artificial intelligence. I hold a Master’s degree in Artificial Intelligence and Adaptive Systems from the University of Sussex, where I graduated with distinction. My MSc thesis, Self-Supervised Machine Learning for Predicting Cancer Dependencies, introduced two innovative models—VAE-DeepDep and MAE-DeepDep—that significantly improved predictions of cancer gene dependencies, uncovering valuable insights into therapeutic targets.

Throughout my academic and professional journey, I have developed expertise in machine learning, computer vision, and full-stack development. My projects have ranged from building CNN models for microscopy image analysis to creating optimized solutions for multi-objective problems and portfolio management. Additionally, my experience with tools like Flutter, React, and .NET has equipped me with a versatile skill set that spans both AI research and software development.

I am driven by a desire to bridge the gap between cutting-edge technology and impactful real-world applications. Whether it is exploring innovative solutions in AI or contributing to interdisciplinary collaborations, my goal is to make meaningful contributions to science and technology.

My Skills

Python

PyTorch

Keras

TensorFlow

Machine Learning

Deep Learning

Computer Vision

Image Processing

NLP

Genetic Algorithms

OOP

AWS

SQL

.Net

Git

Firebase

Flutter

Javascript

Typescript

React

Experience

Research Assistant

Hochegger Lab, University of Sussex

October 2024 - Present

Focused on developing Convolutional Neural Network (CNN) models for microscopy image analysis, particularly for classifying cell division stages. Collaborated with interdisciplinary teams to integrate AI-driven solutions into biological research workflows, enhancing research efficiency and accuracy.

Student Research Assistant

University of Sussex

January 2024 - August 2024

Contributed to an EPSRC-funded research project investigating desert ant vision for navigation tasks. Designed and optimized CNN models, improved preprocessing pipelines, and achieved significant performance gains in image analysis tasks.

Full Stack Developer

SPP42 International

November 2022 - June 2023

Developed a Natural Language Processing-based document retrieval system and integrated it into a scalable web application. Led the development of a mobile application using Flutter and built backend APIs using Python (Sanic framework) and .NET, delivering efficient and user-friendly solutions.

Full Stack Software Developer Intern

2C Information Technologies

June 2022 - October 2022

Designed and developed smart transportation and smart parking system applications using Flutter, React, and Firebase. Created a cryptocurrency and NFT wallet application, ensuring security and functionality for client needs.

Full Stack Software Developer Intern

Gekosis

January 2021 - May 2021

Developed a secure file storage mobile application with encryption features to protect sensitive data and user privacy. Successfully integrated Flutter and Firebase for cross-platform functionality.

Front End Developer Intern

BiSoft Information Technologies

May 2020 - August 2020

Contributed to web development projects using Angular and React, integrating backend services with NodeJS. Delivered responsive and interactive user interfaces for web applications, improving user experience and performance.

Projects

Self-Supervised Learning for Cancer Dependencies

In my master’s thesis, I developed two novel deep learning models, VAE-DeepDep and MAE-DeepDep, to improve the performance of the existing DeepDEP model in predicting gene dependencies in cancer cells. VAE-DeepDep, utilizing variational autoencoders, demonstrated the highest prediction accuracy, while MAE-DeepDep, based on masked autoencoders, also outperformed the original model. Extensive datasets, including TCGA, DepMap, and MSigDB, were used for training and validation, ensuring robust evaluation of the models. Key parameters such as the beta-value in VAE-DeepDep and the mask ratio in MAE-DeepDep were optimized, with minimal performance differences observed across variations. Input dropout analysis revealed that fingerprint data contributed the most to model performance, emphasizing its importance in cancer dependency prediction. Synthetic lethality analyses of four genes provided insights into potential therapeutic targets, further advancing the applicability of the models in cancer research.

Comparative Analysis of DeBERTaV3, LSTM with BERT Word Embeddings, and SVM in Propaganda Detection

In this study, three different language models were run in two different propaganda detection tasks and the results were compared. The models used are DeBERTaV3, LSTM with BERT Word Embeddings (LBWE) and SVM. By selecting these models, a transformer-based model, a model whose word embedding is provided by a transformer-based model but uses LSTM in classifica tion, and a traditional machine learning model that works with word2vec word embeddings were compared. The first task to which these models are applied is to classify whether the sentence contains propaganda or not. The second task is to classify which type of propaganda the sentences containing propaganda contain. In the results obtained, while DeBERTaV3 showed the best performance in the first task, the LBWE model showed the best performance in the second task. In both tasks, SVM achieved the lowest F1 and accuracy scores.

Comparative Analysis of GDE3 and NSGA-II Algorithms on Multi-Objective Capacitated Facility Location Problem

In this project, the GDE3 and NSGA-II algorithms were applied and compared on the Multi-Objective Capacitated Facility Location Problem (MOCFLP). The study aimed to minimize logistics costs and CO2 emissions, revealing that while the GDE3 algorithm performed faster, the NSGA-II algorithm generally achieved more diverse and successful results. The project is significant as it marks the first application of GDE3 to MOCFLP and provides detailed insights into the performance of both algorithms.

Stock Portfolio Optimization Using Genetic Algorithm

In this project, stock portfolio optimization was performed using a Genetic Algorithm (GA) on 15 selected S&P 500 stocks to maximize the Sharpe ratio. The portfolio, tested with 2023 stock prices, achieved a return approximately 2.55 times higher than the S&P 500 index, demonstrating successful results. Additionally, the effects of crossover rate, mutation rate, and population size on convergence time were analyzed, revealing that reducing the mutation rate significantly increases the time to reach the optimal solution.

Comparative Analysis of Connection Patterns in High-Unit Homeostat

In this project, six different connection patterns were tested to determine the most efficient design for high-unit-number homeostats, focusing on adaptation time, complexity, and robustness. The analysis showed that sparse connectivity, importance-based connectivity, and the Watts-Strogatz model performed similarly with lower complexity than the fully connected pattern, which was the most complex. Sparse connectivity was the most robust for 4-7 and 9 units, importance-based connectivity excelled at 8 units, and the Watts-Strogatz model was the most robust at 10 units, while importance-based connectivity outperformed others in adaptation time for high unit numbers.