Avatar

Didi Milikina

Motivated and open-minded graduate of Artificial Intelligence.
Experienced in working with different cultures and solving diverse data and engineering problems. I combine analytical thinking, grit and passion in my craft.


Amsterdam, the Netherlands

d.a.milikina@gmail.com

+31 6 34128258

GitHub | DataCamp | HackerRank | Kaggle | LinkedIn


Professional Skills

Programming
Python, C#

Data Visualisation and Manipulation
Python(Pandas, NumPy, PyTorch, Tensorflow, Matplotlib, Scikit-learn), SQL, Tableau, Microsoft Power BI

Machine Learning and AI
Supervised and Unsupervised Algorithms, Clustering & Classification

Version Control
Git

Front-end development with React.js
JS, HTML5 and CSS3

Team and Product management
Trello, Azure DevOps, Jira


Languages

Bulgarian

Mother language

English

C1 Level (Fluent)

Spanish

B2 Level


Strengths

  • Focus and Perservierence
  • Openness to feedback
  • Ability to take end-to-end ownership of problems
  • Teamwork
  • Creativity
  • Empathy
  • Strategic thinking
  • Adaptability and Embracing change

Interests

  • Machine Learning and AI
  • Mathematics
  • Statistics
  • Team and Project management
  • Organizing and managing events

Work Experience

Data Scientist - Pricing | Otrium | July 2023 - January 2024

Amsterdam, the Netherlands

Otrium is a Netherlands-based fashion technology company that operates an online platform that sells fashion items.

ℹī¸ About the company
The company focuses on creating a sustainable and circular fashion economy by partnering with brands to sell excess or unsold inventory through its platform. Otrium allows brands to reach a larger audience for their products while also addressing the issue of overproduction in the fashion industry.

✅ Role and key deliverables

  • Pricing demand forecast model
    • I achieved an 18.42% improvement in the RMSE score of Otrium's pricing forecast model through strategic changes and techniques. This model is pivotal in the Smart Pricing system, which alters product discounts based on individual product performance.
    • Conducted Comprehensive Model Analysis
    • Implemented Data Transformation Techniques
    • Refined Model Features and Parameters
  • I worked directly with brands such as ASICS, Reiss, Guess, Pepe Jeans and others, providing them with data-driven reports on brand performance, including tailored reports on 24-hour discount deals for exclusive brands, and translated data findings into comprehensible insights for non-technical stakeholders.
  • I collaborated closely with diverse cross-functional teams, including Partner Success Managers (PSMs), business analysts, and other stakeholders. This interaction allowed me to understand their data requirements and deliver robust analytical support.
🛠ī¸ Tools and Technologies
  • Databricks
  • Bitbucket
  • Looker
  • Looker Studio
  • SQL
  • Python
  • Excel/Google Sheets


Artificial Intelligence Intern | GrwNxt | February 2021 - May 2021

Amsterdam, the Netherlands

GrwNxt is a startup Food Tech company that aims to create solutions for autonomous indoor farming.

✅ Role and key deliverables

  • Detected each crop from the growth room and identified each species. The plants were located in special rooms with various sensors, cameras and lighting. The data was transferred, stored, and analyzed. Based on the results of the analysis, further actions were taken if needed.
  • I introduced my work in-depth in the company's final report, which took an important role in the progress of GrwNxt.
  • I implemented solutions that would be incorporated into a system that could detect each leaf, analyze its health, and further define each crop's growth stages to develop specialized Dynamic Digital Recipes for each species.

🛠ī¸ Tools and Technologies
In GrwNxt, we used the Microsoft ecosystem
  • Azure Blob for structured and unstructured data
  • Azure Machine Learning Studio for building, deploying and training models
  • Azure SQL Database
  • Excel


Consultant | deeplearning.ai | April 2020 - December 2020

Amsterdam, the Netherlands

My job as a Consultant at DeepLearning.ai was to review the videos and test the notebooks from the specialisations before publishing them in Coursera. Additionally, I discussed necessary edits regarding the content with the corresponding teams. The mistakes and suggestions were added to Trello boards which were reviewed by the editors. I had worked on the NLP track, the TensorFlow track and partially on the GAN's track.
My goal was to improve the notebooks, which consisted of expressing the requirements in an understandable way for the end-user and adding flowcharts or hints whenever needed.


Sofia Ambassador | Coding Girls | October 2018 - April 2020

Sofia, Bulgaria

Being an ambassador of Coding Girls enabled me to organise and facilitate tech-related events, be a speaker, and take part in important IT events. The events which I organised were for different age groups. Most of them were for women, but I also organised workshops for kids. Supporting more girls to follow their dreams was one of my main objectives. Companies that I worked with were Uber, Leanplum and MindHub, and I attended many events and conferences as an ambassador to represent the organisation and its values.


Internship - Front-end Developer | DevriX | June - July 2018

Sofia, Bulgaria

At DevriX I made plugin for WordPress, used WordPress admin panel for adding new info for sites. Updated some of the components which were in different projects and implemented new ones. Used Sass (Sassy CSS) and JS. During the intern, I accumulate basic knowledge of PHP which we were used for back-end for our plugin.


Internship - Front-end Developer | JBoxers | July - September 2018

Sofia, Bulgaria

At JBoxers I improved my personal JS and React skills with different courses from egghead.io and Udemy. I did small projects to consolidate what I learned and also applied it in the company.


Education

BSc Artificial Intelligence - Vrije Universiteit Amsterdam - GPA 7.6/10

2020 - 2023

Bachelor's Degree: Artificial Intelligence
The Artificial Intelligence bachelor's degree at Vrije Universiteit helped me raise awareness about robots' ethical concerns when developed improperly, owing to the psychology and ethics courses included in the programme. In addition, logic and sets, programming and the core courses in machine learning and text mining helped me better understand and unfold critical and logical thinking about the programming problems and the possible approaches for resolving them.

Specialisation: Intelligent Systems
In addition to the main program that included statistics, linear algebra and databases, the specialisation helped me thoroughly learn data structures and algorithms, machine learning and computational intelligence.

Minor: Applied Econometrics
A detailed introduction to econometric methods and techniques with an emphasis on how to implement and carry out the methods in empirical studies and how to interpret the results. Furthermore, the courses included in the minor examine the stages of model formulation, parameter estimation, diagnostic checking, hypothesis testing, model selection and empirical analysis.

Thesis: Enhanced Sequence-to-Sequence Machine Text Translation: Investigating the Benefits of Attention mechanism and Attention mechanism with GRUs
Scientific paper
The paper I authored investigates the application of Recurrent Neural Networks (RNNs) with attention mechanisms for English-to-French sequence-to-sequence machine text translation. It introduces two transformer models: one using pure attention and the other combining attention with Gated Recurrent Units (GRUs) in an encoder-decoder architecture. Drawing inspiration from "Attention Is All You Need," the study differs by not employing teacher-forcing techniques. Translation performance evaluation using the BiLingual Evaluation Understudy (BLEU) score reveals enhanced results for both transformers with a low learning rate and deeper encoder-decoder blocks.

Language and main library: Python, PyTorch
Visualisation and experimentation tracking: Weights & Biases



Language High-School "22 Georgi Rakovski" in Sofia, Bulgaria

2014 - 2019
Website

I studied Spanish and Spanish culture in addition to math, science, geography, history and biology.

Specializations

Natural Language Processing / deeplearning.ai

Through the specialization, I learned how to design applications that perform question-answering and sentiment analysis, summarise a text and build a chatbot.
In the first course, I performed sentiment analysis of tweets using logistic regression and Naive Bayes, used vector spaces to discover the relationship between words, and wrote a simple English-to-French translation algorithm. In the second course, I applied the Viterbi algorithm for part-of-speech tagging and wrote my own Word2Vec model, which uses a neural network to compute word embeddings. In the third course, I generated synthetic Shakespeare text using a GRU language model and trained a recurrent neural network to perform NER using LTSMs with linear layers. Finally, in the fourth course, I learned how to translate whole sentences using the encoder-decoder attention model and built a Transformer to summarise a text.

Certificates


DeepLearning.AI TensorFlow Developer / deeplearning.ai

Completing all four courses of the DeepLearning.AI TensorFlow Developer successfully enhanced my previous knowledge in TensorFlow. Through the courses, I explored more strategies to prevent overfitting, including augmentation and dropout. I applied transfer learning, extracted learned features from models, and built models with LSTMs, GRUs, and RNNs layers in TensorFlow. Trained LSTMs on existing text to generate poetry based on Shakespeare's poems. Last but not least, I explored how to prepare real-world data and use it to build a prediction model.

Certificates


Deep Learning / deeplearning.ai

This specialization helped me build a solid foundation for my understanding of Machine and Deep Learning. It helped me to learn the best practices and gave me many insights from recognized and influential people in the AI field. It also prepared me for getting more into the depths of different applications and approaches towards Machine Learning.

Certificates


Python Programmer / DataCamp

By completing this career track, I gained more knowledge and experience with Python. There were many coding assessments which covered abundant aspects of Data manipulation, the Python library - pandas(manipulating and merging Data Frames), relational databases in SQL, Intro to Shell and Conda essentials.

Certificate


Fundamentals of Tableau / DataCamp

This skill track helped me familiarise myself with one of the most popular BI tools, namely - Tableau. I learned how to organize and analyze data, create presentation-ready visualizations, build insightful dashboards, and apply analytics to worksheets. Moreover, I used data connectors to combine and prepare datasets and combined multiple data tables with various relationships, joins and unions. Furthermore, I learned how to manage different data properties, like renaming data fields, assigning aliases, changing data types, etc.

Certificate


Data Analyst with Python / DataCamp

This career path successfully enriched my insight into Deep Learning. The track contained more intermediate courses on SQL and Python libraries such as Matplotlib and Seaborn, where I covered a wide variety of displaying the data(making subplots, overlaying plots, as well as strip, swarm and violin plots). Furthermore, there were two courses for Statistical Thinking in Python, where I cleared the previously acquired data to make an accurate conclusion for the tendencies.

Certificate


Data Scientist with Python / DataCamp

Finishing the Data Scientist career path, I successfully upgraded my skills from the last two tracks(Data Analyst and Python Programmer) thanks to the project and the case study I solved.

Certificate


Data Engineer with Python / DataCamp

Completing the Data Engineer track, I deepened my expertise in building datasets from imported files with different file formats and Bash scripting. Moreover, I learned how to process data in Shell and did command-line automation.
I integrated Amazon Web Services (AWS) into data workflow, uploading data to Boto Amazon Simple Storage Service (S3), creating buckets, and subscribing people to SNS to receive critical notifications via SMS. Boto is an AWS software development kit for Python that enables creating, configuring and managing AWS services. Amazon Simple Notification Service helps manage services that deliver messages from producers to consumers.
Another topic was Big data analysis with Apache Spark. Spark helps you interface with RDDs. PySparkSQL is a library to apply SQL analysis, while the MLlib machine learning library is a wrapper over PySpark.
Another discovered area was error handling, transactions in SQL Server, and building and optimising triggers in SQL Server.

Certificate


Machine Learning Scientist with Python / DataCamp

This track gave me significant knowledge about Keras, a Python library capable of running on top of TensorFlow.
There were exercises about building Sequential models with Dense layers(different numbers of neurons and layers). Also, I used Keras for image processing and classifying clothing types. XGBoost is a fast and scalable gradient-boosting library.
I learned how to measure AUC(Area under the curve) and accuracy, import hyperparameters with a Pipeline and many other valuable usages.
Another important Machine Learning tool included in the path was Apache Spark, integrated with Python.

Certificate


Projects

The Impact of Location and Price on Restaurant Ratings

January 2023

Scientific paper

This project aims to investigate the influence of location and price on restaurant ratings, addressing the lack of research in this area. By utilizing advanced statistical techniques, the project seeks to fill this gap and provide valuable insights for the restaurant industry.

Methodology:

  • Cluster Analysis: Employing cluster analysis, the study categorizes restaurants based on their proximity to city centers. This analysis uncovers patterns and clusters within the data, revealing how city center proximity plays a role in influencing restaurant ratings.
  • Multiple Linear Regression: The project utilizes multiple linear regression to assess the main and moderating effects of location and price on restaurant ratings. This technique helps establish relationships between these factors and their impact on ratings.
Findings:
  • Cluster Insights: The analysis uncovers four distinct clusters of restaurants based on their city center proximity. This categorization provides valuable insights into how geographical location influences restaurant ratings.
  • Significant Impact: The research demonstrates that the identified clusters significantly affect restaurant ratings. The project's findings shed light on the previously unexplored relationship between location and ratings.
  • Price Moderation: The study reveals that price has a moderating effect on the relationship between location and restaurant ratings. Specifically, the effect of location on ratings varies depending on the pricing strategy adopted by restaurants.
Conclusion:
By examining the interplay between location, price, and restaurant ratings, this project contributes essential insights for the restaurant industry. The project's findings suggest that restaurants in city areas with high competitor concentrations tend to receive higher ratings. Additionally, the moderation effect of price on restaurant ratings highlights the importance of carefully considering pricing strategies. The knowledge gained from this project offers valuable guidance to restaurant owners and managers seeking to optimize their ratings and overall success.


A Comparative Analysis Of Convolutional Neural Networks Performance Using Class-Weights Versus Applying Oversampling On Highly Imbalanced Image Data Sets

April 2022

Scientific paper

This project investigates strategies for managing imbalanced image datasets in Convolutional Neural Networks (CNNs) by comparing two techniques: oversampling and class weighting. The focus is on a dataset containing images of whales and dolphins, which exhibits a significant class imbalance.

Objectives:

  • Compare the effectiveness of oversampling and class weighting methods for imbalanced data in CNNs.
  • Evaluate the performance of three residual convolutional models with variations in end layers, depth, and width.
Methodology:
  • Employ three residual convolutional models incorporating modern architectural features such as residual connections, global average pooling, and swish activation.
  • Implement oversampling to duplicate instances from minority classes and utilize class weighting to assign appropriate importance to each class during training.
Results:
  • The experiments demonstrate that oversampling leads to higher accuracy in the classification task.
  • However, oversampling also introduces a higher false-positive rate for classes with limited instances.
  • The choice between oversampling and class weighting depends on striking a balance between accuracy and true-positive rate, particularly in real-world production scenarios.
Conclusion:
This project provides insights into handling imbalanced image datasets for CNNs. By comparing oversampling and class weighting techniques, it highlights the trade-offs between accuracy and false-positive rates. The findings can guide the selection of an appropriate approach based on the specific needs of the task and the production environment.


Web app - BooksExchange

February 2018 - July 2018

A platform inspired by the BookCrossing initiative and the trends of a shared economy.
The main idea is to give people the possibility to connect with others that have a particular book in a given region and exchange one book for another without actually putting any monetary value in the exchanged books. This way, small communities can be created within cities/regions/counties or even within a company/school or university.
The biggest goal was to make the most major open library that enables books to travel worldwide.
For the front-end we used React.js with the material design MDL Lite library for styling the interface.
For the back-end we used ASP.NET Core with EF Core and SQL Server.

Demo website - The demo website currently has a lot of placeholder info and media until the actual "content" is poured in.


Web app - MadScience

April 2018 - October 2018

We created the website platform to show our development skills. The main idea was to find diverse and exciting projects. We were motivated and open to new challenges. That was the reason for creating the company which offered such types of services. It provided users to specify and customise their needs for a web platform.

Demo website - The demo website currently has a lot of placeholder info and media until the actual "content" is poured in.


Arduino based self watering planter

Arduino based self watering planter
November 2018 - February 2019

We were motivated to take care of our plants and wanted to give them precisely calculated water and sunlight, without which they could not survive. Some of the species are very demanding and difficult to grow. So we decided to make our custom garden project to monitor them.
We used Arduino to receive and display the values with different components.
For the database, we used Apache Cassandra; for displaying the charts, we decided to use React-Vis, and Python for data ingestion from Arduino through USB (Future versions through LoRaWAN), mapping the input and updating Apache Cassandra and GoLang - REST/GraphQL (not yet decided and implemented) from the Apache Cassandra.

GitHub project

Certificates

Introduction to Power BI - DataCamp
Dec 2020 – Present

Certificate


Machine Learning - Coursera
Stanford University with Andrew Ng
Oct 2018 – Present

Certificate


Google Digital Garage
Sept 2017 – Present

Certificate


Google Analytics Academy - Google Analytics for Beginners
30 May 2018 – 30 May 2020

Certificate


Certificates from Software University

JavaScript for Front-End
Oct 2017 – Present

Certificate


Programming Basics with C++
Sep 2017 – Present

Certificate


Web Fundamentals (HTML/CSS)
Sep 2017 – Present

Certificate


Programming Fundamentals
May 2017 – Present

Certificate


Programming Basics with C#
May 2017 – Present

Certificate


Certificates from Corporate Finance Institute

Accounting Fundamentals
Jul 2019 – Present

Certificate


Reading Financial Statements
Jul 2019 – Present

Certificate


Fixed Income Fundamentals
Jul 2019 – Present

Certificate


Excel Crash Course
Aug 2019 – Present

Certificate


Introduction to Corporate Finance
Aug 2019 – Present

Certificate


Certificates from Udemy

Introduction to AI for Business
Nov 2018 – Present

Certificate


Kickstart Artificial Intelligence
Nov 2018 – Present

Certificate


Certificates from Cognitive Class

Statistics 101
Feb 2019 – Present

Certificate


Machine Learning with Python
Feb 2019 – Present

Certificate


Data Visualization with Python
Feb 2019 – Present

Certificate


Data Analysis with Python
Feb 2019 – Present

Certificate


Certificates from IBM

Statistics 101
Feb 2019 – Present

Certificate


Machine Learning with Python - Level 1
Feb 2019 – Present

Certificate


Applied Data Science with Python - Level 2
Feb 2019 – Present

Certificate


Volunteering

Microsoft
Nov 2017 - Feb 2018

I partially translated Visual Studio Code from English to Bulgarian.


Volunteer Assistant - CoderDojo Bulgaria
Jun 2018

I volunteered in Mega Dojo Sofia 2.0, part of the closing of the Bulgarian Presidency of the Council of the European Union.