NebulaClawNebulaClaw

Moderation Rules

Configure AI-powered auto-moderation rules

NebulaClaw's moderation rules use natural language โ€” describe the behavior you want to moderate, and the AI handles enforcement automatically.

Creating a Rule

Open Rules Page

Navigate to Moderation > Rules from the sidebar.

Add a Rule

Click Add Rule and configure:

  • Description โ€” Describe the behavior to moderate in plain language
  • Action โ€” What happens when the rule is triggered
  • Duration โ€” For mute actions, how long the restriction lasts
  • Enabled โ€” Toggle the rule on or off

Save

Click Add Rule to save. The rule takes effect immediately.

This set covers the six most common violation patterns seen in Web3 / cross-border communities. Copy-paste ready โ€” just add them one by one.

#Description (paste into the rule description field)ActionDuration
1Insults, personal attacks, sustained provocation, or defamation โ€” including against the project, team, or community memberskickโ€”
2Scams or phishing messageskickโ€”
3Requesting or disclosing private keys / seed phrasesbanPermanent
4Spam or advertisingmute24 hours
5Malicious linksmute1 hour
6Fake airdrops or referral-farming schemesmute24 hours

Severity Rationale

  • Ban (permanent) โ€” irreversible harm. Leaking private keys / seed phrases inflicts direct reputation and asset damage.
  • Kick โ€” high-hostility behavior (abuse / scams). Allows a "one-out" reset; rejoining via invite signals willingness to reform.
  • Mute 24h โ€” recurring violations (spam / fake airdrops). A cool-down window that prevents atmosphere decay.
  • Mute 1h โ€” suspicious-but-observational (malicious links). Short cooldown minimizes collateral on real users.
Treat this as a starting point. After week one, review the Audit Log and feed false positives / false negatives back into rule descriptions โ€” precision scales with clarity.

Writing Good Descriptions

Good examples (specific scenario + key qualifiers):

  • "Users posting promotional links or invite links other than Telegram / X"
  • "Messages pushing 'guaranteed returns', 'official cashback', or 'airdrop claim' scams"
  • "Offensive language or personal attacks against other members"
  • "Sharing NSFW content or inappropriate images"

Avoid vague descriptions:

  • โŒ "Bad behavior" (too broad)
  • โŒ "Spam" (too vague โ€” what counts as spam?)
  • โŒ "Violation" (the AI can't tell which violation)

Actions

ActionEffectReversible?
MuteUser cannot send messages for a set durationYes โ€” auto-expires or can be undone
KickUser is removed from the groupPartial โ€” user can rejoin via invite link
BanUser is permanently removedNo โ€” must be manually unbanned

Mute Duration

For mute actions, set a duration in minutes or select Permanent for indefinite muting.

Managing Rules

  • Toggle on/off โ€” Disable a rule without deleting it using the switch
  • Edit โ€” Click the edit icon to modify description, action, or duration
  • Delete โ€” Click the trash icon to permanently remove a rule
Start with lenient rules (short mutes) and tighten over time. Review the Audit Log regularly to check for false positives.

On this page