Data Scientist
We're building the smartest platform in Web3 marketing - and we’re looking for a highly skilled Data Scientist to join us.
We are seeking a highly skilled Data Scientist for Marketing Campaign Analysis.
The ideal candidate will be responsible for aggregating data from text, sound, and video sources from platforms like Twitter/X, TikTok, YouTube and similar sites.
This involves utilizing Speech-to-Text (STT) and video transcription tools to convert audio and video data into text format, applying Natural Language Processing (NLP) techniques to preprocess and clean the aggregated data, and employing Deep Learning (DL) models for feature extraction and data enhancement.
The candidate should develop and fine-tune Encoder-Decoder Large Language Models (LLMs) for stance detection tasks using datasets like SemEval and RumourEval. Additionally, they will implement Sentiment Analysis using Bidirectional Encoder Representations from Transformers (BERT) or similar models to assess public sentiment towards brands. Named Entity Recognition (NER) will be used to track specific entities and influencers.
The role also involves using NER to identify and categorize entities such as brands, influencers, and products across various social media platforms. The candidate will apply Influencer Analysis techniques to evaluate the impact and relevance of influencers using NLP and Machine Learning (ML). Audience Segmentation will be conducted using clustering algorithms like K-means or DBSCAN to align influencer audiences with target markets.
Furthermore, the candidate will identify and integrate external data sources (e.g., APIs, web scraping) to enhance campaign and influencer assessments. They will utilize web scraping techniques to collect additional data from relevant websites and forums.
In terms of predictive modeling, the candidate will develop Predictive Models using Regression, Time Series Analysis, and Machine Learning techniques (e.g., Random Forest, SVM) to forecast campaign outcomes. They will employ ARIMA, Prophet, or LSTM models for time series forecasting of campaign metrics and use tools like Google Analytics or Adobe Analytics for campaign performance evaluation.