Edit filter configuration

Differences between versions

ItemVersion from 20:35, 12 February 2022 by Suffusion of YellowVersion from 23:55, 14 February 2022 by Suffusion of Yellow
Basic information
Notes:
Creating as requested by Sole Soul, log only for now. - Someguy1221
Creating as requested by Sole Soul, log only for now. - Someguy1221


Hasn't been tested sufficiently for false positives. Change to warn and tag. - KoH
Hasn't been tested sufficiently for false positives. Change to warn and tag. - KoH


Adding "wuss rock". --Reaper
Adding "wuss rock". --Reaper
Put \b around cunt, remove suck (we already have a filter for that). - KoH
Put \b around cunt, remove suck (we already have a filter for that). - KoH


Change back to disallow. Run for over 9 days, 1683 hits, no false positives. - Someguy1221
Change back to disallow. Run for over 9 days, 1683 hits, no false positives. - Someguy1221


Use same quote exemption as filter 9. - KoH
Use same quote exemption as filter 9. - KoH


As a test, remove the edit_delta > 0" condition. Meanwhile, the filter is tag only. -Sole Soul
As a test, remove the edit_delta > 0" condition. Meanwhile, the filter is tag only. -Sole Soul
Change to disallow. -Sole Soul
Change to disallow. -Sole Soul


Optimize. -Sole Soul
Optimize. -Sole Soul
Add some bad words, remove others with no hits. Will monitor log for any FPs. -Sole Soul
Add some bad words, remove others with no hits. Will monitor log for any FPs. -Sole Soul
Removal of the word "balls" and renaming the filter. -Sole Soul
Removal of the word "balls" and renaming the filter. -Sole Soul


No need to be private. Sole Soul
No need to be private. Sole Soul


Fix to avoid false positive of "Teledildonics". Od Mishehu
Fix to avoid false positive of "Teledildonics". Od Mishehu


rm "hello" - causing FPs in music-related pages. Also not a bad word. --Reaper 2012-05-05
rm "hello" - causing FPs in music-related pages. Also not a bad word. --Reaper 2012-05-05


Use single string. - KoH
Use single string. - KoH


Wrap both sides of "was here". - KoH
Wrap both sides of "was here". - KoH


Remove mild or ambiguous terms. There is a reason why we have filter 11 for "suck." - KoH
Remove mild or ambiguous terms. There is a reason why we have filter 11 for "suck." - KoH


Hiding due to potential for abuse.
Hiding due to potential for abuse.


Format and reduce condition usage. -DF
Format and reduce condition usage. -DF


Delimit "slut" to allow "Slutsky", at least for now. RF 2015-06-19
Delimit "slut" to allow "Slutsky", at least for now. RF 2015-06-19


Why is there the line  
Why is there the line  
! removed_lines irlike "\w\*\*?\w" ?
! removed_lines irlike "\w\*\*?\w" ?


Optimize - we can reduce false positives in future. RF 20150817
Optimize - we can reduce false positives in future. RF 20150817
Remove false positives due to dismbig pages with bad words in their name.  Note this test is only carried out if it would otherwise be considered vandalism, and so is rare. RF 20150819
Remove false positives due to dismbig pages with bad words in their name.  Note this test is only carried out if it would otherwise be considered vandalism, and so is rare. RF 20150819


Include the spelling "paedophile" in the words to be caught. --Od Mishehu
Include the spelling "paedophile" in the words to be caught. --Od Mishehu
Remove "awesome" for now. RF 2015-10-29
Remove "awesome" for now. RF 2015-10-29
! added_lines rlike "\bDick\b\s[A-Z][a-z]" & ...
! added_lines rlike "\bDick\b\s[A-Z][a-z]" & ...
Should exclude FPS for poele called Dick Foo. RF 20151101
Should exclude FPS for poele called Dick Foo. RF 20151101


Merging in one rule from 285 so it can be disabled. Filter does not need to be private as a general-purpose vandalism filter, methinks. Also increasing edit delta and removing max length of added_lines, those wouldn't seem to help false positives that much. Finally, removing the now-unneeded short-circuiting that was fixed in MediaWiki 1.28. I'm keeping a close eye on the logs ~MA 2016.09.11
Merging in one rule from 285 so it can be disabled. Filter does not need to be private as a general-purpose vandalism filter, methinks. Also increasing edit delta and removing max length of added_lines, those wouldn't seem to help false positives that much. Finally, removing the now-unneeded short-circuiting that was fixed in MediaWiki 1.28. I'm keeping a close eye on the logs ~MA 2016.09.11


Removing "hi", causing false positives and is already covered by Special:AbuseFilter/664 ~MA 2017.01.01
Removing "hi", causing false positives and is already covered by Special:AbuseFilter/664 ~MA 2017.01.01


Adding cuck. Tested at filter 839, looks good. --Someguy1221
Adding cuck. Tested at filter 839, looks good. --Someguy1221


Allow people called Foo Dick as well as Dick Foo. -Cyp 2017-10-09
Allow people called Foo Dick as well as Dick Foo. -Cyp 2017-10-09


Add summary tests as well.-CYBERPOWER
Add summary tests as well.-CYBERPOWER


Adding an exception for edit summaries containing a bad word, for if the article already contained a bad word, and the summary looks to be autogenerated from a section edit. Without it, this triggers false positives when new users and IPs edit sections whose title contains a bad word. Happens with some music-related pages. --Someguy1221
Adding an exception for edit summaries containing a bad word, for if the article already contained a bad word, and the summary looks to be autogenerated from a section edit. Without it, this triggers false positives when new users and IPs edit sections whose title contains a bad word. Happens with some music-related pages. --Someguy1221


Excluding "It was here" from the "____ was here" test. --Someguy1221
Excluding "It was here" from the "____ was here" test. --Someguy1221


Cleanup, whitelist {{sortname|Dick|Tracy}}, etc. ~MA
Cleanup, whitelist {{sortname|Dick|Tracy}}, etc. ~MA


Change "\bcrap" to "\bcrap(?:py)?\b" per [[Special:PermaLink/879717525#24.243.2.93]] -Galo 2019.01.23
Change "\bcrap" to "\bcrap(?:py)?\b" per [[Special:PermaLink/879717525#24.243.2.93]] -Galo 2019.01.23


Per [[Special:PermaLink/881084124#194.69.14.123]] no need to check page_title for just disambiguation pages -Galo 2019.01.31
Per [[Special:PermaLink/881084124#194.69.14.123]] no need to check page_title for just disambiguation pages -Galo 2019.01.31


Remove edit summary portion of the filter per [[Special:PermaLink/882767769#149.135.11.157]] (needs to be moved to another filter so that false positives can be reduced). -Galo 2019.02.11
Remove edit summary portion of the filter per [[Special:PermaLink/882767769#149.135.11.157]] (needs to be moved to another filter so that false positives can be reduced). -Galo 2019.02.11


Allow words staring with "blah", see [[Wikipedia:Edit_filter/False_positives/Archive_94#2A01:110F:4101:1900:A075:F1A7:6AF3:5409]] --SoY 7 Mar 2019
Allow words staring with "blah", see [[Wikipedia:Edit_filter/False_positives/Archive_94#2A01:110F:4101:1900:A075:F1A7:6AF3:5409]] --SoY 7 Mar 2019


Ignore cases where links have bad words per FPs. -G 2019-04-11
Ignore cases where links have bad words per FPs. -G 2019-04-11


Allow "Dick and Jane", "Tom, Dick, and Harry", etc. --Suffusion of Yellow 00:43 13 Apr 2019
Allow "Dick and Jane", "Tom, Dick, and Harry", etc. --Suffusion of Yellow 00:43 13 Apr 2019


Allow "Peniston", not just "Penistone". Suggested by DannyS712, see [[Special:PermaLink/902284907#Tiptoeslightly]] --Suffusion of Yellow 20:27 17 Jun 2019
Allow "Peniston", not just "Penistone". Suggested by DannyS712, see [[Special:PermaLink/902284907#Tiptoeslightly]] --Suffusion of Yellow 20:27 17 Jun 2019


Merge in some words from filter 839; see that filter's notes for attribution and links to discussions. --Suffusion of Yellow 22:23 4 Dec 2019
Merge in some words from filter 839; see that filter's notes for attribution and links to discussions. --Suffusion of Yellow 22:23 4 Dec 2019


Rm useless negative lookahead assertion; was trying to mach the (s?|sucker) after "cock". --Suffusion of Yellow 22:37 4 Dec 2019
Rm useless negative lookahead assertion; was trying to mach the (s?|sucker) after "cock". --Suffusion of Yellow 22:37 4 Dec 2019


Use non-capturing groups to prevent similar errors in the future. --Suffusion of Yellow 22:42 4 Dec 2019
Use non-capturing groups to prevent similar errors in the future. --Suffusion of Yellow 22:42 4 Dec 2019


Rm triple-big; no matches in 5000 hits. --Suffusion of Yellow 22:53 4 Dec 2019
Rm triple-big; no matches in 5000 hits. --Suffusion of Yellow 22:53 4 Dec 2019


Also copy "dumb\s*ass" from filter 839; still undecided about plain "ass". --Suffusion of Yellow 23:07 4 Dec 2019
Also copy "dumb\s*ass" from filter 839; still undecided about plain "ass". --Suffusion of Yellow 23:07 4 Dec 2019


Whitelist "Foo & Dick", etc. --Suffusion of Yellow 18:49 20 Dec 2019
Whitelist "Foo & Dick", etc. --Suffusion of Yellow 18:49 20 Dec 2019


+fatass and variants --Suffusion of Yellow 21:31 24 Jan 2021
+fatass and variants --Suffusion of Yellow 21:31 24 Jan 2021
+more variants --Suffusion of Yellow 01:01 29 Jan 2021
+more variants --Suffusion of Yellow 01:01 29 Jan 2021
+a$$ variants too --Suffusion of Yellow 02:05 30 Jan 2021
+a$$ variants too --Suffusion of Yellow 02:05 30 Jan 2021


Allow last name "Dicks" --Suffusion of Yellow 20:42 16 Feb 2021
Allow last name "Dicks" --Suffusion of Yellow 20:42 16 Feb 2021


Also check for accented Dick Names. --Suffusion of Yellow 18:36 5 Mar 2021
Also check for accented Dick Names. --Suffusion of Yellow 18:36 5 Mar 2021


+shithole, tested in 839 --Suffusion of Yellow 21:37 15 Jan 2022
+shithole, tested in 839 --Suffusion of Yellow 21:37 15 Jan 2022


Add more citation template parameters to Dick Name check; change suggested by Tamzin. --Suffusion of Yellow 23:42 11 Feb 2022
Add more citation template parameters to Dick Name check; change suggested by Tamzin. --Suffusion of Yellow 23:42 11 Feb 2022


Allow "Penisi" per FPs. --Suffusion of Yellow 20:35 12 Feb 2022
Allow "Penisi" per FPs. --Suffusion of Yellow 20:35 12 Feb 2022
Add a newline between page_title and added_links, to avoid breaking word-boundary checks. --Suffusion of Yellow 23:55 14 Feb 2022
Filter conditions
Conditions:
(documentation)
edit_delta < 250 &
edit_delta < 250 &
!("confirmed" in user_groups) &
!("confirmed" in user_groups) &
page_namespace == 0 & (
page_namespace == 0 & (
     dick_name := "\bDick('s\s\p{L}\p{Ll}|(,?\s(and|&))?\s\p{Lu}[\p{Ll}.])|\b(\p{Lu}(\p{Ll}+|\.)?(,?\s(and|&))?|[DM]r\.)\sDicks?\b|\b(first|last|author|given|surname)(\d*(-?link)?|-?link\d*)?\s*=\s*Dicks?\b|{{\s*[Ss]ortname\s*\|\s*Dicks?\s*\|";
     dick_name := "\bDick('s\s\p{L}\p{Ll}|(,?\s(and|&))?\s\p{Lu}[\p{Ll}.])|\b(\p{Lu}(\p{Ll}+|\.)?(,?\s(and|&))?|[DM]r\.)\sDicks?\b|\b(first|last|author|given|surname)(\d*(-?link)?|-?link\d*)?\s*=\s*Dicks?\b|{{\s*[Ss]ortname\s*\|\s*Dicks?\s*\|";


     !(summary irlike "und(id|o)|\brv[vt]?\b|revert") & (
     !(summary irlike "und(id|o)|\brv[vt]?\b|revert") & (
         bad_word := "\b(?:ass|butt)\s?holes?\b(?!\Wsurfer)|\b(?:dumb|(?:f|ph)at+(?:est)?)\s*a(?:ss|\$\$)\b|bitch(?!\"? magazine)|\bcock(?:s?|sucker)\b|\bdick(?:sucker|head)?s?\b|cuck(?:|s|er|ing)\b|\bcunts?\b|\bdildo|douche|f(?:u|v)ck|nigg(?:er|ah*)|p(?:a|e|ae)do(?:f|ph)il|fag(?:g[aio]t)?\b|penis(?!ton|i)|\bcrap(?:py)?\b|\bslut\b|whore|\b(?:bull|dog|horse)?shit(?:hole|s|ti?er|t?y|t?ing)?\b|\bblah\b|\bpuss(?!ycat|y Riot)a?y|\bidiot\b|(?<!\bit )(?<!\bthat )\bwas here\b|\blol\b|(?:[asdfghjkl;]\s*){30}|\!{10,}";
         bad_word := "\b(?:ass|butt)\s?holes?\b(?!\Wsurfer)|\b(?:dumb|(?:f|ph)at+(?:est)?)\s*a(?:ss|\$\$)\b|bitch(?!\"? magazine)|\bcock(?:s?|sucker)\b|\bdick(?:sucker|head)?s?\b|cuck(?:|s|er|ing)\b|\bcunts?\b|\bdildo|douche|f(?:u|v)ck|nigg(?:er|ah*)|p(?:a|e|ae)do(?:f|ph)il|fag(?:g[aio]t)?\b|penis(?!ton|i)|\bcrap(?:py)?\b|\bslut\b|whore|\b(?:bull|dog|horse)?shit(?:hole|s|ti?er|t?y|t?ing)?\b|\bblah\b|\bpuss(?!ycat|y Riot)a?y|\bidiot\b|(?<!\bit )(?<!\bthat )\bwas here\b|\blol\b|(?:[asdfghjkl;]\s*){30}|\!{10,}";


         added_lines irlike bad_word &
         added_lines irlike bad_word &
         !(added_lines rlike dick_name) &
         !(added_lines rlike dick_name) &
         !(removed_lines irlike (bad_word + "|\w\*{1,4}\w")) &
         !(removed_lines irlike (bad_word + "|\w\*{1,4}\w")) &
         !((page_title + added_links) irlike bad_word)
         !((page_title + "\n" + added_links) irlike bad_word)
     )
     )
)
)