> ## Documentation Index
> Fetch the complete documentation index at: https://dify-6c0370d8-docs-new-agent-experience.mintlify.site/llms.txt
> Use this file to discover all available pages before exploring further.

# Twitter Account Analyzer

> Build a chatflow that scrapes a Twitter profile via Crawlbase and analyzes the user's tweets with an LLM

## Introduction

In Dify, you can use some crawler tools, such as Jina, which can convert web pages into markdown format that LLMs can read.

Recently, [wordware.ai](https://www.wordware.ai/) has brought to our attention that we can use crawlers to scrape social media for LLM analysis, creating more interesting applications.

However, knowing that X (formerly Twitter) stopped providing free API access on February 2, 2023, and has since upgraded its anti-crawling measures. Tools like Jina are unable to access X's content directly.

> Starting February 9, we will no longer support free access to the Twitter API, both v2 and v1.1. A paid basic tier will be available instead 🧵
>
> — Developers (@XDevelopers) [February 2, 2023](https://twitter.com/XDevelopers/status/1621026986784337922?ref_src=twsrc%5Etfw)

Fortunately, Dify also has an HTTP tool, which allows us to call external crawling tools by sending HTTP requests. Let's get started!

## Prerequisites

### Register Crawlbase

Crawlbase is an all-in-one data crawling and scraping platform designed for businesses and developers. Crawlbase Scraper can pull data from social platforms like X, Facebook, and Instagram.

Register at [crawlbase.com](https://crawlbase.com).

### Sign in to Dify

Open Dify in your browser and sign in. You'll need access to a running Dify instance to follow along.

### Configure LLM Providers

Go to **Integrations** > **Model Provider**, install at least one model provider (for example, OpenAI), and configure its credentials.

<Frame>
  ![Configure Model Provider in Account Setting](https://assets-docs.dify.ai/dify-enterprise-mintlify/en/workshop/intermediate/4b4102f9027e2bda3fc520eaa8ea2354.png)
</Frame>

## Create a chatflow

Now, let's get started on the chatflow.

Click on `Create from Blank` to start:

<Frame>
  ![Create from Blank to Start](https://assets-docs.dify.ai/dify-enterprise-mintlify/en/workshop/intermediate/b2955735f5c122d8a2fc08ef13654239.png)
</Frame>

The initialized chatflow should be like:

<Frame>
  ![](https://assets-docs.dify.ai/dify-enterprise-mintlify/en/workshop/intermediate/baee341b771d1cd77780fd4845b467b2.png)
</Frame>

## Add nodes to chatflow

<Frame>
  ![The Final Chatflow Looks Like This](https://assets-docs.dify.ai/dify-enterprise-mintlify/en/workshop/intermediate/bad3185d9f2c92994c24de65a5414182.png)
</Frame>

### Start node

In start node, we can add some system variables at the beginning of a chat. In this article, we need a Twitter user's ID as a string variable. Let's name it `id`.

Click on Start node and add a new variable:

<Frame>
  ![Start Node and Add a New Variable](https://assets-docs.dify.ai/dify-enterprise-mintlify/en/workshop/intermediate/a041be2230364d7e729527f3f7af34d8.png)
</Frame>

### Code node

According to [Crawlbase docs](https://crawlbase.com/docs/crawling-api/scrapers/#twitter-profile), the variable `url` (this will be used in the following node) should be `https://twitter.com/` + `user id`, such as `https://twitter.com/elonmusk` for Elon Musk.

To convert the user ID into a complete URL, we can use the following Python code to integrate the prefix `https://twitter.com/` with the user ID:

```python theme={null}
def main(id: str) -> dict:
    return {
        "url": "https://twitter.com/"+id,
    }
```

Add a code node and select python, and set input and output variable names:

<Frame>
  ![Add a Code Node and Select Python, and Set Input and Output Variable Names](https://assets-docs.dify.ai/dify-enterprise-mintlify/en/workshop/intermediate/e5523ba1f801f4009b74e7cf03e2ef2f.png)
</Frame>

### HTTP request node

Based on the [Crawlbase docs](https://crawlbase.com/docs/crawling-api/scrapers/#twitter-profile), to scrape a Twitter user's profile in http format, we need to complete HTTP request node in the following format:

<Frame>
  ![Based on the Crawlbase Docs, to Scrape a Twitter User's Profile in HTTP Format,](https://assets-docs.dify.ai/dify-enterprise-mintlify/en/workshop/intermediate/13899d88abeb3b3be20c44d40565a5f9.png)
</Frame>

Importantly, it is best not to directly enter the token value as plain text for security reasons, as this is not a good practice. Actually, in the latest version of Dify, we can set token values in **`Environment Variables`**. Click `env` - `Add Variable` to set the token value, so plain text will not appear in the node.

Check [https://crawlbase.com/dashboard/account/docs](https://crawlbase.com/dashboard/account/docs) for your crawlbase API Key.

<Frame>
  ![Check HTTPS](https://assets-docs.dify.ai/dify-enterprise-mintlify/en/workshop/intermediate/c99b66ac8d30289615a8869bae5a6455.png)
</Frame>

By typing `/`, you can easily insert the API Key as a variable.

<Frame>
  ![By Typing / , You Can Easily Insert the API Key as a Variable](https://assets-docs.dify.ai/dify-enterprise-mintlify/en/workshop/intermediate/51f9350677acb396bad5841fa80c903c.png)
</Frame>

Tap the start button of this node to check whether it works correctly:

<Frame>
  ![Tap the Start Button of This Node to Check Whether It Works Correctly](https://assets-docs.dify.ai/dify-enterprise-mintlify/en/workshop/intermediate/094b96e513169a47f1749e46e1357893.png)
</Frame>

### LLM node

Now, we can use LLM to analyze the result scraped by crawlbase and execute our command.

The value `context` should be `body` from HTTP Request node.

The following is a sample system prompt.

<Frame>
  ![](https://assets-docs.dify.ai/dify-enterprise-mintlify/en/workshop/intermediate/46f4e15ac1e9d3ca3f47dc5bb921ff01.png)
</Frame>

## Test run

Click `Preview` to start a test run and input twitter user id in `id`.

<Frame>
  ![Preview to Start a Test Run and Input Twitter User ID in ID](https://assets-docs.dify.ai/dify-enterprise-mintlify/en/workshop/intermediate/a25b122dfa14f0c65fcd3498ccf1898e.png)
</Frame>

For example, I want to analyze Elon Musk's tweets and write a tweet about global warming in his tone.

<Frame>
  ![](https://assets-docs.dify.ai/dify-enterprise-mintlify/en/workshop/intermediate/835a01082e74723138d9f97bee0c6c4b.png)
</Frame>

Does this sound like Elon? lol

Click `Publish` in the upper right corner and add it in your website.

Have fun!

## Lastly...

### Other X(Twitter) Crawlers

In this article, I've introduced crawlbase. It should be the cheapest Twitter crawler service available, but sometimes it cannot correctly scrape the content of user tweets.

The Twitter crawler service used by [wordware.ai](https://www.wordware.ai/) mentioned earlier is **Tweet Scraper V2**, but the subscription for the hosted platform **apify** is \$49 per month.

## Links

* [X@dify\_ai](https://x.com/dify_ai)
* Dify's repo on GitHub: [https://github.com/langgenius/dify](https://github.com/langgenius/dify)
