Moderation Rules
Configure AI-powered auto-moderation rules
NebulaClaw's moderation rules use natural language โ describe the behavior you want to moderate, and the AI handles enforcement automatically.
Creating a Rule
Open Rules Page
Navigate to Moderation > Rules from the sidebar.
Add a Rule
Click Add Rule and configure:
- Description โ Describe the behavior to moderate in plain language
- Action โ What happens when the rule is triggered
- Duration โ For mute actions, how long the restriction lasts
- Enabled โ Toggle the rule on or off
Save
Click Add Rule to save. The rule takes effect immediately.
Recommended Baseline Ruleset
This set covers the six most common violation patterns seen in Web3 / cross-border communities. Copy-paste ready โ just add them one by one.
| # | Description (paste into the rule description field) | Action | Duration |
|---|---|---|---|
| 1 | Insults, personal attacks, sustained provocation, or defamation โ including against the project, team, or community members | kick | โ |
| 2 | Scams or phishing messages | kick | โ |
| 3 | Requesting or disclosing private keys / seed phrases | ban | Permanent |
| 4 | Spam or advertising | mute | 24 hours |
| 5 | Malicious links | mute | 1 hour |
| 6 | Fake airdrops or referral-farming schemes | mute | 24 hours |
Severity Rationale
- Ban (permanent) โ irreversible harm. Leaking private keys / seed phrases inflicts direct reputation and asset damage.
- Kick โ high-hostility behavior (abuse / scams). Allows a "one-out" reset; rejoining via invite signals willingness to reform.
- Mute 24h โ recurring violations (spam / fake airdrops). A cool-down window that prevents atmosphere decay.
- Mute 1h โ suspicious-but-observational (malicious links). Short cooldown minimizes collateral on real users.
Treat this as a starting point. After week one, review the Audit Log and feed false positives / false negatives back into rule descriptions โ precision scales with clarity.
Writing Good Descriptions
Good examples (specific scenario + key qualifiers):
- "Users posting promotional links or invite links other than Telegram / X"
- "Messages pushing 'guaranteed returns', 'official cashback', or 'airdrop claim' scams"
- "Offensive language or personal attacks against other members"
- "Sharing NSFW content or inappropriate images"
Avoid vague descriptions:
- โ "Bad behavior" (too broad)
- โ "Spam" (too vague โ what counts as spam?)
- โ "Violation" (the AI can't tell which violation)
Actions
| Action | Effect | Reversible? |
|---|---|---|
| Mute | User cannot send messages for a set duration | Yes โ auto-expires or can be undone |
| Kick | User is removed from the group | Partial โ user can rejoin via invite link |
| Ban | User is permanently removed | No โ must be manually unbanned |
Mute Duration
For mute actions, set a duration in minutes or select Permanent for indefinite muting.
Managing Rules
- Toggle on/off โ Disable a rule without deleting it using the switch
- Edit โ Click the edit icon to modify description, action, or duration
- Delete โ Click the trash icon to permanently remove a rule
Start with lenient rules (short mutes) and tighten over time. Review the Audit Log regularly to check for false positives.

