Social Sentiment Worker: Initial Implementation #48
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Summary
This PR introduces a complete social sentiment analysis worker, including all supporting scripts, configuration, and documentation, under the
ai/directory. The worker fetches tweets, processes them for token mentions, analyzes sentiment, and stores results in ClickHouse, enabling downstream analytics and reporting.Key Features
Social Sentiment Core Worker (
social_sentiment_core_worker.py):Configuration & Data Files:
social_sentiment_worker_kol_list.csv: List of authors (KOLs) to monitor.social_sentiment_worker_tokens_list.csv: List of tokens/symbols/fullnames to detect in tweets.social_sentiment_pipeline.png: Visual overview of the sentiment analysis pipeline.social_sentiment_core_worker.md: Documentation and usage instructions.Settings Integration:
How It Works
Fetch Raw Tweets:
The worker fetches recent tweets for each author in the KOL list, storing new tweets in ClickHouse.
Process Sentiment:
For each unprocessed tweet, the worker:
Orchestration:
The worker is designed to be triggered and managed via Camunda BPM, supporting external task handling.
How to Test
.envfile with the required ClickHouse, RapidAPI, and MindsDB credentials.