User talk:Cyberpower678/Archive 67
This is an archive of past discussions with User:Cyberpower678. Do not edit the contents of this page. If you wish to start a new discussion or revive an old one, please do so on the current talk page. |
Archive 60 | ← | Archive 65 | Archive 66 | Archive 67 | Archive 68 | Archive 69 | Archive 70 |
Minor issue in uk
Hello my friend, ukwiki sends you warmest greetings and big thanks for the InternetArchiveBot work
Remember that weird issue with tripling last letter in the name of the month of November in Ukrainian? Bot still does that: it puts "листопадааа" instead of "листопада". I remember looking with you at this issue at Wikimania; unsuccessfully. Can you please try to fix this? Thanks a lot :) -- Ата (talk) 18:53, 27 September 2019 (UTC)
- Ата, Can you give me a recent example to look at? Sorry for the late response, BTW.—CYBERPOWER (Message) 07:53, 3 October 2019 (UTC)
- Here: [1], [2] --Ата (talk) 14:27, 3 October 2019 (UTC)
- Alright. I'm going to use those pages to experiment. It probably won't be this weekend, but I'll look into it on Monday at the latest.—CYBERPOWER (Message) 07:17, 5 October 2019 (UTC)
- Here: [1], [2] --Ата (talk) 14:27, 3 October 2019 (UTC)
Hello, sorry if I disturb you in the midst of busy times. Could you help proceed with this OAbot task? Headbomb has helped trial the functionality but I understand he'd prefer not to handle the actual approval himself. Your experience would be appreciated in deciding what to do about the links to CiteSeerX in particular: they're supported by Help:Citation Style 1#Identifiers and WP:COPYLINKS appears to explicitly greenlight them with "It is currently acceptable to link to Internet archives" (CiteSeerX is an open repository which archives academic sources, makes them accessible via OAI-PMH and extracts structured citation data from unstructured PDFs). However, a couple users have concerns. Nemo 08:05, 2 October 2019 (UTC)
- Nemo bis, I'll take a look at it later today.—CYBERPOWER (Message) 07:19, 5 October 2019 (UTC)
InternetArchiveBot Shenanigans v.2
Followup on [3]: Your bot is still doing the same thing. Please stop or fix it asap. -- Meisam (talk) 10:34, 3 October 2019 (UTC)
- Meisam I have stopped it in the meantime.—CYBERPOWER (Message) 07:19, 5 October 2019 (UTC)
Disabled on nowp due to false positives
InternetArchiveBot was enabled on nowp today after being disabled a month ago. Today, I have reported dozens of false positives, and estimate that at least 20% of the bot edits are false positives. This has been reported on phabricator. - 4ing (talk) 17:42, 3 October 2019 (UTC)
- 4ing I very doubt the false positive rate is that high, but I will take a look into it when I get the next opportunity to. Thanks for the heads up.—CYBERPOWER (Message) 07:20, 5 October 2019 (UTC)
InternetArchiveBot on idwiki
Dear Cyberpower678. I am a fan of IABot made by you, and sometimes I use the tool to fix broken external links here. But, I wonder if the tool can be used in idwiki to fix the same thing. Usually the community found that a link is broken and don't have the idea how to fix that, or they know how to archive links but they have to change the same thing on hundreds articles.
Do you think idwiki is eligible to run such tool (IABot)? If so, what requirements do we need to comply? Thank you. ··· 🌸 Rachmat04 · ☕ 09:36, 4 October 2019 (UTC)
- A community discussion followed by an open request for approval would be the best first steps. :-)—CYBERPOWER (Message) 07:21, 5 October 2019 (UTC)
I have nominated someone for RFA
I have nominated User:Greenman for RFA. His nomination is located here: Wikipedia:Requests for adminship/Greenman. It has not shown up yet. LefcentrerightTalk (plz ping) 11:51, 5 October 2019 (UTC)
- Lefcentreright, You have to transclude it at WP:RFA. SQLQuery me! 19:34, 5 October 2019 (UTC)
New book not having book reports generated
I started the Book:State highways in Georgia (U.S. state). However, I have never seen any book reports generated. Is there some code to add that will get it generating? Morriswa (Charlotte Allison) (talk) 07:47, 7 October 2019 (UTC)
Archive date format
I hope it's the right place, IABot puts the dates in the wrong format. I don't understand what it's due to. --Emanuele676 (talk) 01:47, 8 October 2019 (UTC)
Hello, sorry if I disturb you in the midst of busy times. Could you help proceed with this OAbot task? Headbomb has helped trial the functionality but I understand he'd prefer not to handle the actual approval himself. Your experience would be appreciated in deciding what to do about the links to CiteSeerX in particular: they're supported by Help:Citation Style 1#Identifiers and WP:COPYLINKS appears to explicitly greenlight them with "It is currently acceptable to link to Internet archives" (CiteSeerX is an open repository which archives academic sources, makes them accessible via OAI-PMH and extracts structured citation data from unstructured PDFs). However, a couple users have concerns. Nemo 08:05, 2 October 2019 (UTC)
- Nemo bis, I'll take a look at it later today.—CYBERPOWER (Message) 07:19, 5 October 2019 (UTC)
- Thanks! Let me know if I can help figure it out. Nemo 14:00, 11 October 2019 (UTC)
Internet Archive and (quality of) bibliographic metadata
Hey, grabbing you based on your (otherwise unrelated) association with the Internet Archive…
Over the years, in my work here, I've kinda vaguely observed that the bibliographic metadata at the Internet Archive for scanned books is pretty poor. But over the last year or so I've been spending my time more over at Wikisource, and thus looked a bit closer at the issue; as well as watching the efforts at Wikidata related to books and bibliographic data. And my conclusion now has clarified to be: IA's bibliographic data is utter crap.
Just now I needed to look for scans of W. B. Yeats's works, and searching at IA it turns out they have these listed under "w. b. (william butler) yeats", "w. b. yeats", "w. b. (william butler) yeats", and "yeats, w. b. (william butler), 1856-1939", and probably a bunch of others hidden behind a "More" button. They have modern editions listed with year of publication set to the when the original was published. They have edition listed with the editor as the author, and with the author nowhere in the metadata. Publisher and place of publication is missing for most works. And let's not even get started on copyright status.
Meanwhile—between Wikisource, Wikidata, and Wikipedia—we're crowdsourcing a ton of this metadata in a structured or semi-structured format; more often than not compensating here for lack of good metadata there. And, like Worldcat, there seems to be no good way to feed our efforts back into their data.
This strikes me as… well, not to put too fine a point on it… blindingly stupid.
Can you think of any way to at least start addressing this? I don't particularly mean in terms of "Let's make a bot that scrapes… and then …"; but rather is there avenues or channels for discussing common interests and ways to take advantage of each others work? If I through Commons, Wikisource, or Wikidata (usually all three) have added a bunch of structured bibliographic data for a scan of a work, and have tagged the scan as being a particular IA identfier (i.e. it's the Source field over on Commons), what mechanisms might work for feeding that back into IA's metadata? That might be a bot that they control and so can trust that cherrypicks stuff they want on our side; or we could build some framework for them to trust us to provide that data directly. A formal cooperation—mediated by the WMF—that lets trusted users here edit metadata on their side directly? Maybe they could support adding a Wikidata ID from us, and a widget in their web interface that just shows information from Wikidata (rather than overwriting their own data)? There's got to be a bunch of different ways and models for working together, and plenty of common interest and mutual benefit to be had.
It's just… it drives me batty that they have three+ different free-text strings identifying the same author, and that everyone in the world will have to waste time on that, when I could have clicked a "merge authors" button and added William Butler Yeats (Q40213) and just fixed it for them.
To the degree you talk to anybody over there that might be interested in this kind of stuff, giving them a poke about it might be worthwhile. (I'm assuming the benefits to the Wikimedia side are obvious, so won't go into that). --Xover (talk) 11:11, 10 October 2019 (UTC)
- The Internet Archive generally imports MARC records from whichever library is participating, they don't correct upstream errors. Merges are supposed to happen on openlibrary.org as much as possible, not on archive.org itself. Nemo 14:02, 11 October 2019 (UTC)
Shift 2 Unleashed talk - 3 external link sections
There are 3 external links sections on the talk page, and they look like duplicates. Could this be reduced to one talk section? Rcgldr (talk) 22:07, 13 October 2019 (UTC)
Trouted
Whack! You've been whacked with a wet trout. Don't take this too seriously. Someone just wants to let you know that you did something silly. |
You have been trouted for: making a bot Kugihot ❯❯❯ Vanguard 15:22, 14 October 2019 (UTC)
Cyberbot appears not to fully understand Move protection
In the case I just noticed, the bot didn't concede that the unprotection request was satisfied because the page was still kept under *move* protection. But nobody had asked for anything to be done to the move protection, only the regular protection. The bot's action was mentioned at RFPP. One admin urged the bot to 'think out of the box.' Though I confess I have never looked up the instructions regarding Cyberbot. Are there any instructions? Does "Automated comment" mean that the bot is not satisfied and it will not archive the request until something more is done? In such cases do admins have to 'request immediate archiving'? Thanks for handling this thankless task. EdJohnston (talk) 17:12, 15 October 2019 (UTC)
Book talk missing title and missing end tag for italics, small, bold errors
You are messing up italics and titles in Book talk articles.
{{book report|Number 1's (Destiny's Child album)|''|GA|problems=|non-free=* [[:File:Destiny's Child – Number 1's.jpg]]
- causing a missing end tag for italics (
''
and title not displaying where needed; should be (and I fixed it to) {{book report|Number 1's (Destiny's Child album)|''Number 1's''|GA|problems=|non-free=* [[:File:Destiny's Child – Number 1's.jpg]]
{{book report|Number 1's (Destiny's Child album)|''|GA|chapter=Album|problems=|non-free=* [[:File:Destiny's Child – Number 1's.jpg]]
- causing a missing end tag for italics (
''
and title not displaying where needed; should be (and I fixed it to) {{book report|Number 1's (Destiny's Child album)|''Number 1's''|GA|chapter=Album|problems=|non-free=* [[:File:Destiny's Child – Number 1's.jpg]]
{{book report|''Stargate: The Ark of Truth|The Ark of Truth''|Unassessed|problems=* Page does not exist.
- causing a missing end tag for italics (
''
; should be (and I fixed it and you reverted and I fixed it again to) {{book report|Stargate: The Ark of Truth|''The Ark of Truth''|Unassessed|problems=* Page does not exist.
{{book report|''Stargate: Continuum|Continuum''|Unassessed|problems=|non-free=}}
- causing a missing end tag for italics (
''
; should be (and I fixed it and you reverted and I fixed it again to) {{book report|Stargate: Continuum|''Continuum''|Unassessed|problems=|non-free=}}
{{book report|''Stargate: Revolution|Revolution''|Unassessed|problems=|non-free=}}
- causing a missing end tag for italics (
''
; should be (and I fixed it and you reverted and I fixed it again to) {{book report|Stargate: Revolution|''Revolution''|Unassessed|problems=|non-free=}}
{{book report|Tak to chodí|''Tak to chodí'']] <small>by [[Michal Horáček (lyricist)|Start|chapter=Compilations|problems=|non-free=* [[:File:Tak to chodi front.jpg]]
- causing a missing end tag for
<small>
; should be (and I fixed it and you reverted) {{book report|Tak to chodí|''Tak to chodí'']] <small>by [[Michal Horáček (lyricist)</small>|Start|chapter=Compilations|problems=|non-free=* [[:File:Tak to chodi front.jpg]]
{{book report|''Wieland der Schmied (libretto)|Wieland der Schmied''|Unassessed|problems=* Page does not exist.
- causing a missing end tag for
<small>
; should be (and I fixed it and you reverted) {{book report|Wieland der Schmied (libretto)|''Wieland der Schmied''|Unassessed|problems=* Page does not exist.
{{WBOOKS|class=book}}{{book report start|<big>'''Wiki How To</big>'''|The basics of Wikipedia}}
- causing misnested tags; should be (and I fixed it)
{{WBOOKS|class=book}}{{book report start|<big>'''Wiki How To'''</big>|The basics of Wikipedia}}
And similar errors that I fixed that you haven't reverted yet in:
- Book talk:Korn
- Book talk:Anthrax
- Book talk:Miami Hurricanes
- Book talk:Stephen King
- Book talk:Brandy Norwood
- Book talk:William Holden
- Book talk:Video game series
- Book talk:Nicktoons
- Book talk:Resident Evil series
- Book talk:Military conflicts of the Three Kingdoms era
- Book talk:Boeing Passenger Jets
- Book talk:Arkansas Confederate Infantry Units
- Book talk:Christianity
- Book talk:Diabetes mellitus type 1
And similar errors that I haven't fixed at Lint errors: Missing end tag in the Book talk namespace. Please deal with these errors, or at least point how to generate the list of "source code" pages and how to edit them to make the problem go away. —Anomalocaris (talk) 23:32, 15 October 2019 (UTC)
IABot on idwiki — Follow up
Hi Cyberpower678!
Following our last discussion, I am writing to let you know that I have opened a discussion on idwiki's village pump about the proposal of putting idwiki at the IABot Management Interface. Five people agreed to the proposal so far, and I believe it is enough for you to make it happen. Please let me know if you need anything else. Best, ··· 🌸 Rachmat04 · ☕ 07:09, 16 October 2019 (UTC)
You have been selected as an reserve election commissioner for 2019's ArbCom election
Greetings! Thank you for volunteering to serve as an election commission for WP:ACE2019. Following the community discussion at Wikipedia:Requests for comment/Arbitration Committee Elections December 2019/Electoral Commission, you have been selected as a reserve election commissioner for this year's election. Best of luck! — xaosflux Talk 00:07, 19 October 2019 (UTC)
Book reports again
Hi,
There is a conversation here about when reports are generated for any given book by Cyberbot1/NoomBot, especially the first one for a new book. Are you able to clarify? — Cheers, Steelpillow (Talk) 20:41, 24 October 2019 (UTC)
Inaccurate edit summary
I do not believe the error report was sent, so I am trying it this way
The following InternetArchiveBot edit to American Motors has the summary "Rescuing 2 sources and tagging 0 as dead.", yet the action performed was adding 2 archive sources and tagging 2 as dead, as seen below
Before <ref name=sharf>{{cite magazine|url=http://wardsautoworld.com/ar/auto_lee_iacocca_knew/index.html|author=Stephan Sharf|title=Lee Iacocca as I knew him; he was certainly the right man at the right time..|magazine=Ward's AutoWorld|date=May 1, 1996|accessdate=August 31, 2012}}</ref>
After <ref name=sharf>{{cite magazine|url=http://wardsautoworld.com/ar/auto_lee_iacocca_knew/index.html|author=Stephan Sharf|title=Lee Iacocca as I knew him; he was certainly the right man at the right time..|magazine=Ward's AutoWorld|date=May 1, 1996|accessdate=August 31, 2012|archive-url=https://web.archive.org/web/20110728081658/http://wardsautoworld.com/ar/auto_lee_iacocca_knew/index.html|archive-date=July 28, 2011|url-status=dead}}</ref>
Before <ref>{{cite magazine|url=http://wardsautoworld.com/ar/auto_daimlerchrysler_ifs/index.html|title=DaimlerChrysler: The 'What Ifs?'|magazine=Ward's AutoWorld||date=June 1, 1998|accessdate=August 31, 2012}}</ref>
After <ref>{{cite magazine|url=http://wardsautoworld.com/ar/auto_daimlerchrysler_ifs/index.html|title=DaimlerChrysler: The 'What Ifs?'|magazine=Ward's AutoWorld|4=|date=June 1, 1998|accessdate=August 31, 2012|archive-url=https://web.archive.org/web/20110728081757/http://wardsautoworld.com/ar/auto_daimlerchrysler_ifs/index.html|archive-date=July 28, 2011|url-status=dead}}</ref>
82.14.227.91 (talk) 15:37, 21 October 2019 (UTC)
- There is nothing inaccurate with the edit summary. The source was rescued and the dead flag set to make the archive link be dominant in the reference. There is nothing wrong here.—CYBERPOWER (Trick or Treat) 22:22, 24 October 2019 (UTC)
Book talk missing title and missing end tag for italics, small, bold errors
Cyberpower678: You deleted the following discussion without taking action. The last time, you ultimately did solve the problem, and I'm confident you will solve this one also. Please do not remove this discussion from your talk page without at least acknowledging. I don't want to go through a long cycle of dredging it up from the page history over and over, like last time. (I have edited my previous comment slightly, mainly inserting several missing right parentheses.) —Anomalocaris (talk) 05:14, 24 October 2019 (UTC)
You are messing up italics and titles in Book talk articles.
{{book report|Number 1's (Destiny's Child album)|''|GA|problems=|non-free=* [[:File:Destiny's Child – Number 1's.jpg]]
- causing a missing end tag for italics (
''
) and title not displaying where needed; should be (and I fixed it to) {{book report|Number 1's (Destiny's Child album)|''Number 1's''|GA|problems=|non-free=* [[:File:Destiny's Child – Number 1's.jpg]]
{{book report|Number 1's (Destiny's Child album)|''|GA|chapter=Album|problems=|non-free=* [[:File:Destiny's Child – Number 1's.jpg]]
- causing a missing end tag for italics (
''
) and title not displaying where needed; should be (and I fixed it to) {{book report|Number 1's (Destiny's Child album)|''Number 1's''|GA|chapter=Album|problems=|non-free=* [[:File:Destiny's Child – Number 1's.jpg]]
{{book report|''Stargate: The Ark of Truth|The Ark of Truth''|Unassessed|problems=* Page does not exist.
- causing a missing end tag for italics (
''
); should be (and I fixed it and you reverted and I fixed it again to) {{book report|Stargate: The Ark of Truth|''The Ark of Truth''|Unassessed|problems=* Page does not exist.
{{book report|''Stargate: Continuum|Continuum''|Unassessed|problems=|non-free=}}
- causing a missing end tag for italics (
''
); should be (and I fixed it and you reverted and I fixed it again to) {{book report|Stargate: Continuum|''Continuum''|Unassessed|problems=|non-free=}}
{{book report|''Stargate: Revolution|Revolution''|Unassessed|problems=|non-free=}}
- causing a missing end tag for italics (
''
); should be (and I fixed it and you reverted and I fixed it again to) {{book report|Stargate: Revolution|''Revolution''|Unassessed|problems=|non-free=}}
{{book report|Tak to chodí|''Tak to chodí'']] <small>by [[Michal Horáček (lyricist)|Start|chapter=Compilations|problems=|non-free=* [[:File:Tak to chodi front.jpg]]
- causing a missing end tag for
<small>
; should be (and I fixed it and you reverted) {{book report|Tak to chodí|''Tak to chodí'']] <small>by [[Michal Horáček (lyricist)</small>|Start|chapter=Compilations|problems=|non-free=* [[:File:Tak to chodi front.jpg]]
{{book report|''Wieland der Schmied (libretto)|Wieland der Schmied''|Unassessed|problems=* Page does not exist.
- causing a missing end tag for italics (
''
); should be (and I fixed it and you reverted) {{book report|Wieland der Schmied (libretto)|''Wieland der Schmied''|Unassessed|problems=* Page does not exist.
{{WBOOKS|class=book}}{{book report start|<big>'''Wiki How To</big>'''|The basics of Wikipedia}}
- causing misnested tags; should be (and I fixed it)
{{WBOOKS|class=book}}{{book report start|<big>'''Wiki How To'''</big>|The basics of Wikipedia}}
And similar errors that I fixed that you haven't reverted yet in:
- Book talk:Korn
- Book talk:Anthrax
- Book talk:Miami Hurricanes
- Book talk:Stephen King
- Book talk:Brandy Norwood
- Book talk:William Holden
- Book talk:Video game series
- Book talk:Nicktoons
- Book talk:Resident Evil series
- Book talk:Military conflicts of the Three Kingdoms era
- Book talk:Boeing Passenger Jets
- Book talk:Arkansas Confederate Infantry Units
- Book talk:Christianity
- Book talk:Diabetes mellitus type 1
And similar errors that I haven't fixed at Lint errors: Missing end tag in the Book talk namespace. Please deal with these errors, or at least point how to generate the list of "source code" pages and how to edit them to make the problem go away. —Anomalocaris (talk) 23:32, 15 October 2019 (UTC)
- Anomalocaris, I'm not deleting discussions, they just get archived by a bot. I took a quick look at this issue, but nothing sticks out this time, so I will need to find a moment to investigate this.—CYBERPOWER (Trick or Treat) 22:47, 24 October 2019 (UTC)
Tewiki iabot deployment followup
Hello Cyberpower678,
As only minor verification step is pending for iabot deployment on Telugu wikpedia, may i request your quick response to the change. Thanks --Arjunaraoc (talk) 06:31, 25 October 2019 (UTC)
IABot on idwiki — Follow up
Hi Cyberpower678!
Following our last discussion, I am writing to let you know that I have opened a discussion on idwiki's village pump about the proposal of putting idwiki at the IABot Management Interface. Five people agreed to the proposal so far, and I believe it is enough for you to make it happen. Please let me know if you need anything else. Best, ··· 🌸 Rachmat04 · ☕ 04:04, 22 October 2019 (UTC)
- Rachmat04, Alright. I've got a lot of work for IABot on my plate right now. I might not file an approval request immediately.—CYBERPOWER (Trick or Treat) 22:24, 24 October 2019 (UTC)
- That's fine, thank you. Please inform me if you need anything else on our side. ··· 🌸 Rachmat04 · ☕ 10:35, 25 October 2019 (UTC)
InternetArchiveBot for kawiki
Hello Cyberpower678! in Georgian Wikipedia, we decided that it would be useful if your bot "InternetArchiveBot" is also active in our Wikipedia. Let's start its adaptation for the Georgian Wikipedia, write me what is needed for this and begin. --Mehman 97 13:10, 23 October 2019 (UTC)
- Mehman97, Right now, patience. I'm a bit backlogged with IABot right now.—CYBERPOWER (Trick or Treat) 22:25, 24 October 2019 (UTC)
- OK. --Mehman 97 10:41, 25 October 2019 (UTC)
- I created task in Phabricator. --Mehman 97 17:38, 25 October 2019 (UTC)
- OK. --Mehman 97 10:41, 25 October 2019 (UTC)
Book reports again (reposted)
Hi,
I reposted this because we did not receive an answer and can find no information on the bot help page or elsewhere. Please could you give us any pointers to what you do or do not know, and are or are not able to help with, here?
There is a conversation here about when reports are generated for any given book by Cyberbot1/NoomBot, especially the first one for a new book. Are you able to clarify? — Cheers, Steelpillow (Talk) 08:54, 28 October 2019 (UTC)
IABot returns webcite links with broken text
Many links containing text written in Cyrillic/Japan writing using not UTF-8 encoding was archived to webcite with unrecoverably broken text, see for example https://www.webcitation.org/6E6ZP6HX9. All Cyrillic/Japan symbols was replaced by the Unicode symbol �, what rendered in certain pages as О©╫О©╫О©╫... or пїЅпїЅпїЅ... or 鐃緒申鐃緒申鐃緒申... etc. I have written a bot that removes such links from articles, but your bot returns them: [4] => [5]. I think, your bot should not reinsert such broken links into articles. I can explain, how I detect such links in my bot. MBH (talk) 18:38, 23 October 2019 (UTC)
- MBH, you can actually interface your bot with IABot. If you're interested in doing such a thing let me know.—CYBERPOWER (Trick or Treat) 22:27, 24 October 2019 (UTC)
- I'm interested. MBH (talk) 00:24, 25 October 2019 (UTC)
- MBH, m:InternetArchiveBot/API is basically documentation that allows a bot to interface with IABot. Which wiki is your bot running on?—CYBERPOWER (Trick or Treat) 02:09, 25 October 2019 (UTC)
- ruwiki. MBH (talk) 14:05, 25 October 2019 (UTC)
- MBH, m:InternetArchiveBot/API is basically documentation that allows a bot to interface with IABot. Which wiki is your bot running on?—CYBERPOWER (Trick or Treat) 02:09, 25 October 2019 (UTC)
- I'm interested. MBH (talk) 00:24, 25 October 2019 (UTC)
- I would be interested how you detect the bad WebCite links. -- GreenC 02:53, 25 October 2019 (UTC)
- I'm getting a list of all pages with links to webcite using API method list=exturlusage. For every page I get a list of all webcite links on page using API method prop=extlinks. For every link I download this link (downloaded page is almost empty, because all its content is in the frame on page), then I download https://www.webcitation.org/mainframe.php , which contains content of this link. I read this page as a byte stream and convert this stream into UTF-8. Then I remove all except <body> content, remove HTML tags, HTML character entity references, scripts and whitespaces, and calculate ratio between � symbol and any other symbols in the resulting text; if quantity of normal symbols is more than quantity of � symbol less than 5 times, it's certainly a broken page. I don't do this check if page isn't a HTML document (webcite contains many saved PDF documents) or if HTML code contains a string "charset=windows-1251" (because some such links, but not all, was correctly saved with windows-1251 encoding). I'm making a list of links on page that was identified as broken and removes it using regex (also removes "empty" links, when link is just https?://www.webcitation.org/? ). I can publish a C# code. MBH (talk) 14:05, 25 October 2019 (UTC)
- MBH , thank you for this information. Can I make you an offer/suggestion, that we collaborate on cleaning up the IABot database? I have tools that interface with the IABot API. I can provide you a list of all webcite URLs in the IABot database - this will be a complete list of all WebCite URLs in use across all Wikipedia languages (ru, en etc). You run the C# bot on this list, and report which are broken. Then I replace those with alternatives at other providers, like archive.org -- FYI I work with Cyberpower678 and the Internet Archive on various projects. I have known about the � problem for a while but wasn't sure how to detect them, which you have solved. -- GreenC 14:24, 25 October 2019 (UTC)
- The fact is that I'm still not completely sure that my algorithm works correctly in all cases. Rarely bot deletes non-broken links, I find a flaw in algorithm and correct it. For example, I still not grasp, how to distinguish broken and non-broken pages with win-1251 encoding. I want to polish algorithm (it may take some days or even weeks) and then collaborate with you. MBH (talk) 15:09, 25 October 2019 (UTC)
- MBH, excellent! Look forward to it. Also could you look at this page (I generate) and help understand why row #18 column F the number is so high compared to other languages? Similar for row #22. It's an unusual data spike I never understood the high usage of webcite, was it done by bot or for a technical or policy reason? -- GreenC 16:43, 25 October 2019 (UTC)
- It's result of mass bot-archiving by ru:user:WebCite_Archiver in 2012-2013, see special:CentralAuth/WebCite_Archiver. MBH (talk) 17:00, 25 October 2019 (UTC)
- Interesting thanks for this information. -- GreenC 20:04, 26 October 2019 (UTC)
- It's result of mass bot-archiving by ru:user:WebCite_Archiver in 2012-2013, see special:CentralAuth/WebCite_Archiver. MBH (talk) 17:00, 25 October 2019 (UTC)
- Another possibility, if you don't want to check all those links due to bandwidth or time etc.. is make a version of the program that takes a webcite URL on the command-line and returns "1" or "0" if it is a bad link or not. Then I can call it from another program. But I don't have a C# compiler and my tools are all unix based so that may be a problem. -- GreenC 14:59, 25 October 2019 (UTC)
- I run my bot on Toolforge using native mono, it's not a problem. MBH (talk) 15:09, 25 October 2019 (UTC)
- MBH , thank you for this information. Can I make you an offer/suggestion, that we collaborate on cleaning up the IABot database? I have tools that interface with the IABot API. I can provide you a list of all webcite URLs in the IABot database - this will be a complete list of all WebCite URLs in use across all Wikipedia languages (ru, en etc). You run the C# bot on this list, and report which are broken. Then I replace those with alternatives at other providers, like archive.org -- FYI I work with Cyberpower678 and the Internet Archive on various projects. I have known about the � problem for a while but wasn't sure how to detect them, which you have solved. -- GreenC 14:24, 25 October 2019 (UTC)
- I'm getting a list of all pages with links to webcite using API method list=exturlusage. For every page I get a list of all webcite links on page using API method prop=extlinks. For every link I download this link (downloaded page is almost empty, because all its content is in the frame on page), then I download https://www.webcitation.org/mainframe.php , which contains content of this link. I read this page as a byte stream and convert this stream into UTF-8. Then I remove all except <body> content, remove HTML tags, HTML character entity references, scripts and whitespaces, and calculate ratio between � symbol and any other symbols in the resulting text; if quantity of normal symbols is more than quantity of � symbol less than 5 times, it's certainly a broken page. I don't do this check if page isn't a HTML document (webcite contains many saved PDF documents) or if HTML code contains a string "charset=windows-1251" (because some such links, but not all, was correctly saved with windows-1251 encoding). I'm making a list of links on page that was identified as broken and removes it using regex (also removes "empty" links, when link is just https?://www.webcitation.org/? ). I can publish a C# code. MBH (talk) 14:05, 25 October 2019 (UTC)
- I can't distinguish broken and non-broken links to sites using win-1251 (russian) encoding. Here [6] first 2 links isn't broken, next 3 links is broken. For bot, all of it is broken (so my bot consider link is unbroken if it contains "charset=windows-1251" declaration), and all of it contains "charset=windows-1251". Have you any ideas about how to distinguish that cases? MBH (talk) 14:47, 26 October 2019 (UTC)
- First thought is retrieve the frame content and check for repeating
пїЅпїЅпї
strings. -- GreenC 20:04, 26 October 2019 (UTC)- For bot, there are isn't any �, there are ����� in all 5 links. MBH (talk) 01:59, 27 October 2019 (UTC)
- First thought is retrieve the frame content and check for repeating
- User:GreenC, can you provide me a list of random webcite links from IABot database, 10k-50k elements? I want to check all of it using my algorithm, create a list of "broken" links and we can check it manually. If there are errors, I will fix the algorithm, and if there are very few errors, algorithm can be used for IABot. Here is an example of my bot's edit, all of this links are really broken. MBH (talk) 15:01, 26 October 2019 (UTC)
- User:MBH:
/data/project/botwikiawk/wikicite.txt
a sample shuffled from an old full list with 'sort -R'. Please copy and I will delete. Will have the new full list by tomorrow. -- GreenC 20:04, 26 October 2019 (UTC)- Copied. MBH (talk) 01:59, 27 October 2019 (UTC)
- User:MBH:
User:GreenC, I launched my algorithm for testing your first list, that contains 40k webcite links. It processes ~1k links per hour (it does 2 web requests to laggy website for every link), so checking all 545k links from your full base can take ~23 days. Now it processed 2/3 links from list and links considered as broken are written to /data/project/mbh/webcite_result.txt. "utf" and "ru" is coeffitients, means probability of "link is broken", they calculated as "unbroken chars/broken chars integer ratio on page". Please, check this list, and if that links are really broken, you can use my algorithm. MBH (talk) 17:32, 29 October 2019 (UTC)
- MBH, I looked and presume results in webcite_result are bad links. I will wait ~23 days for the process to finish, when ready let me know. Not too concerned about detection errors because in most cases it will replace with a working archive at different provider so no harm done other than sporadic content drift. -- GreenC 19:17, 29 October 2019 (UTC)
Bug on WP:fr
Hi! I don’t know if your bot will be running again on WP:fr but be aware of a bug I just discovered. Perhaps you already fixed it!
There are currently 404 pages with |=}}
because of your bot. Can be found by searching insource:/\| *= *"}}"/
.
Regards FDo64 (talk) 19:41, 30 October 2019 (UTC)
IA Bot question
To keep with the trend, HI! I noticed IA Bot did not seem to archive this dead link that returned a 404 on Indiana bat (I have manually archived the link). I was wondering if this was on purpose, and/or if it is simply hard to determine if the page is indeed dead programmatically, even if it returns a 404 response. (I don't actually know how that bot works yet :P)
I am making the assumption that said bot is run at least yearly, but my assumption could be wrong.
Either way, I also wanted to mention that you have inspired me to be a better wikipeidean. I may come to you when I get started on making my own (approved) bot. :) Nerketur (talk) 15:48, 1 November 2019 (UTC)
IABot not adding archive links to some links
Hi Cyberpower,
I wanted to add archive links to all 300 xlinks at List of members of the 19th Bundestag. Lazy as I am, I thought I'd just let IABot do that for me (with the "Add archives to all non-dead references" option), and it did most of them. Some links were missed, though (for example ref no. 222, 261, 262). I first thought that's maybe because archive.org doesn't have the links, so I told archive.org to save these pages. I ran the bot again, and it still doesn't add archive links to these pages :( Is there something I'm doing wrong (other than being lazy)? Thank you for any help, —Kusma (t·c) 16:07, 1 November 2019 (UTC)
Administrators' newsletter – November 2019
News and updates for administrators from the past month (October 2019).
Interface administrator changes
|
|
- An RfC was closed with the consensus that the resysop criteria should be made stricter.
- The follow-up RfC to develop that change is now open at Wikipedia:Requests for comment/2019 Resysop Criteria (2).
- A related RfC is seeking the community's sentiment for a binding desysop procedure.
- Eligible editors may now nominate themselves as candidates for the 2019 Arbitration Committee Elections. The self-nomination period will close November 12, with voting running from November 19 through December 2.
Problem with one of IA Bot messages at SqWiki
Hello! When a problem has occurred during the archiving process, the bot posts the following message at the talk page:
Hello. During the archive process, the archive returned errors for one or more sites that I submitted for archiving.\nBelow, I have included the links that returned with an error and the following error message.\n\n{problematiclinks}\nIn any event this will be the only notification in regards to these links, and no further attempt will be made to archive the links.\n\nCheers.
It says it will include the links that had a problem and the error message they had but I suspect it never does. See here. In Albanian it reads "...me gabimin" (the following error message) and then the line is empty. I think that thing happens every time an archiving problem occurs. Is the code missing something? Or have I translated it wrong? - Klein Muçi (talk) 19:45, 31 October 2019 (UTC)
- I've been monitoring the IA contributions these days regarding this behavior and, like I suspected, it's always the same. Never does any kind of error message show up after the link. That part is definitely missing something or I have translated it badly. If it is the later case, please be kind enough to explain to me more thoroughly what that message actually means. The way I've understood it is that IA Bot tried to archive some links but for a specific reason couldn't do it and therefore notified us with a message including the links and the reason why it couldn't archive them (the error message). - Klein Muçi (talk) 01:11, 4 November 2019 (UTC)
Problem with one of IA Bot messages at SqWiki (de-archived)
Hello! When a problem has occurred during the archiving process, the bot posts the following message at the talk page:
Hello. During the archive process, the archive returned errors for one or more sites that I submitted for archiving.\nBelow, I have included the links that returned with an error and the following error message.\n\n{problematiclinks}\nIn any event this will be the only notification in regards to these links, and no further attempt will be made to archive the links.\n\nCheers.
It says it will include the links that had a problem and the error message they had but I suspect it never does. See here. In Albanian it reads "...me gabimin" (the following error message) and then the line is empty. I think that thing happens every time an archiving problem occurs. Is the code missing something? Or have I translated it wrong? - Klein Muçi (talk) 19:45, 31 October 2019 (UTC)
- I've been monitoring the IA contributions these days regarding this behavior and, like I suspected, it's always the same. Never does any kind of error message show up after the link. That part is definitely missing something or I have translated it badly. If it is the later case, please be kind enough to explain to me more thoroughly what that message actually means. The way I've understood it is that IA Bot tried to archive some links but for a specific reason couldn't do it and therefore notified us with a message including the links and the reason why it couldn't archive them (the error message). - Klein Muçi (talk) 01:11, 4 November 2019 (UTC)
- @Cyberpower678: please, take a look at this because an archiving bot won't let me post it there. :P - Klein Muçi (talk) 13:10, 8 November 2019 (UTC)
- @Cyberpower678: if you have time, I repeat my request. I'll try to write again at your discussion page if I don't get any answer here in a few days. - Klein Muçi (talk) 11:14, 20 November 2019 (UTC)
InternetArchiveBot page links
It's great to see page link being added by InternetArchiveBot, as discussed recently in the press. however, I'm concerned that, in edits like this one two links, and not one, to the same page are being created.
The relevant markup (some parameters omitted or clarity), for example, is:
{{cite book|title=Martin Luther King Jr.: A Dream of Hope|page=[https://archive.org/details/martinlutherking0000flem/page/9 9] |url=https://archive.org/details/martinlutherking0000flem/page/9}}
whereas just:
{{cite book|title=Martin Luther King Jr.: A Dream of Hope|page=9 |url=https://archive.org/details/martinlutherking0000flem/page/9}}
or alternatively:
{{cite book|title=Martin Luther King Jr.: A Dream of Hope|page=[https://archive.org/details/martinlutherking0000flem/page/9 9] }}
will suffice. Or perhaps:
{{cite book|title=Martin Luther King Jr.: A Dream of Hope|page=[https://archive.org/details/martinlutherking0000flem/page/9 9] |url=https://archive.org/details/martinlutherking0000flem/ }}
would make more sense, since |url=
is used to link the title of the book, not the page number.
Perhaps you might canvas opinion, and determine consensus, at suitable forum, such as Wikipedia talk:Citing sources#Internet Archive page links? Andy Mabbett (Pigsonthewing); Talk to Andy; Andy's edits 13:03, 4 November 2019 (UTC)
A barnstar for you!
The Anti-Vandalism Barnstar | |
Молодец! Ты настоящий герой Википедии! Удачи в будущем! Прости, что на русском. Kazrok4545 (talk) 07:48, 5 November 2019 (UTC) |
I really hope this was the result of a rogue autocorrect. —Cryptic 02:19, 5 November 2019 (UTC)
- Don't you know of the new Wikipedia standards. Consensus can only form through constipation. XD. Seriously, though, yea bad autocorrect.—CYBERPOWER (Around) 02:22, 5 November 2019 (UTC)
- Dear god man, what would be a sufficient amount of constipation to accept the proposal! :D Nosebagbear (talk) 15:09, 5 November 2019 (UTC)
Cyberbot I can't handle asterisk in user name
Maybe you already know this and it's not worth fixing but I didn't find anything in a quick search of your archives. I think asterisks in user names breaks User:Cyberbot I when generating User:Cyberbot I/Requests for unblock report. Some background, I was trying to work out why an editor (User:Tatzref) did not show up in the table when I noticed something suspicious, the table ended at P. Checking Category:Requests for unblock I noticed User talk:Pynf6PLq*8bl which looks the sort of thing that could easily break something. Alternatively maybe it's User:Prayingmantis211 who should be next alphabetically, but I have no idea why their request would break the bot. (Having a quick look at the history and I think they also showed up before.) Nil Einne (talk) 11:52, 8 November 2019 (UTC)
- Nil Einne, That might not be it, it isn't ending on the P's at the moment. Maybe there's a hard limit to the # of rows (that would make a lot of sense, too). Looking at the source, I think it might be limited to 50 rows. SQLQuery me! 00:50, 12 November 2019 (UTC)
Scholarship
I want to join this school.please how will I get started..am from Nigeria..I need a reply — Preceding unsigned comment added by Damzkid619 (talk • contribs) 22:30, 12 November 2019 (UTC)
- (talk page stalker) Please see the message I left on your own user talk page. Home Lander (talk) 22:39, 12 November 2019 (UTC)
AFD template removal
Hi, Apologies, Birgit Maass AFD template totally removed in error. Not sure how. Battleofalma (talk) 15:19, 15 November 2019 (UTC)
Minor issue in ukwiki (2)
Hi, I'm here again, same issue as before: bot continues to use "листопадааа" instead of "листопада" for November. Recent instances: [7], [8], [9], [10]. Thanks for your time. -- Ата (talk) 20:14, 5 November 2019 (UTC)
- Ата, yea, sorry, I haven't had much opportunity to look into this. But it's on my todo list.—CYBERPOWER (Chat) 13:45, 16 November 2019 (UTC)
Tewiki iabot deployment followup - 2nd reminder
I contacted on 25 Oct through your talk page. Here is another gentle reminder.
Hello Cyberpower678,
As only minor verification step is pending for iabot deployment on Telugu wikpedia, may i request your quick response to the change. Thanks --Arjunaraoc (talk) 15:57, 7 November 2019 (UTC)
- Arjunaraoc, I'll take a look at it later today or tomorrow. Sorry for the delay.—CYBERPOWER (Chat) 13:56, 16 November 2019 (UTC)
Inaccurate edit summary
... The following InternetArchiveBot edit to American Motors has the summary "Rescuing 2 sources and tagging 0 as dead.", yet the action performed was adding 2 archive sources and tagging 2 as dead...
82.14.227.91 (talk) 15:37, 21 October 2019 (UTC)
- There is nothing inaccurate with the edit summary. The source was rescued and the dead flag set to make the archive link be dominant in the reference. There is nothing wrong here.—CYBERPOWER (Trick or Treat) 22:22, 24 October 2019 (UTC)
From the response, I can only apologise for hurting your feelings with my use of the term "Inaccurate". 82.14.227.91 (talk) 23:01, 7 November 2019 (UTC)
- I can't tell if you're being sarcastic or not, but my feelings were not hurt.—CYBERPOWER (Chat) 13:56, 16 November 2019 (UTC)
Tp the history of the Skyscraper
Regarding the history of the skyscraper. There needs to be an addition of Leland Tower (aka Leland Hotel as it was originally). It was the first skyscraper outside of Chicago in 1928.It was built before the Chrysler Building and the Empire State Building. It also encompasses the idea of the skeleton construction as created by William Le Baron Jenney. In 1926 it was presented to Chicago architects Graven and Mayger, who had worked with Rapp and Rapp with the design of the Chicago Theater. Graven and Mayger were known for their theater designs but Leland was their first tall building as a partnership. Leland was completed in February 1928. Leland had details that you would have seen in the theaters of that time period. It was a hotel with a ballroom on the topmost floor called "The Sky Club", it is now apartments owned by David Karademas. — Preceding unsigned comment added by Tracy Duran (talk • contribs) 18:12, 10 November 2019 (UTC)
- Tracy Duran I'm not exactly sure what you're talking about. I don't work with Skyscraper articles.—CYBERPOWER (Chat) 13:59, 16 November 2019 (UTC)
I have a crush...
I have a crush on User:InternetArchiveBot. My favorite bot ever. Just thought I say that and thanks! --- Coffeeandcrumbs 12:52, 13 November 2019 (UTC)
- Coffeeandcrumbs, aww thanks. :-)—CYBERPOWER (Chat) 14:02, 16 November 2019 (UTC)
William Radde edit - Thank You :)
Thank you for this edit to William Radde https://en.wikipedia.org/w/index.php?title=William_Radde&diff=925567721&oldid=909031560. Outlier59 (talk) 03:34, 14 November 2019 (UTC)
- You're welcome. Be sure to also thank GreenC as his bot made that particular edit. :-)—CYBERPOWER (Chat) 14:04, 16 November 2019 (UTC)
Uncoding
Hi, why? 83.219.136.11 (talk) 08:40, 16 November 2019 (UTC)
- It's a configuration option within IABot. It's currently set to normalize the URL to a properly encoded state. This can be disabled by any admin at https://tools.wmflabs.org/iabot/index.php?page=wikiconfig&wiki=ruwiki. —CYBERPOWER (Chat) 14:07, 16 November 2019 (UTC)
RfA precentages
Hi Cyberpower678. I would like to get your thoughts on the idea of adding a percentage calculation (and possibly color coded) to User:Cyberpower678/Tally or Template:RfX tally. This would provide a snapshot of each candidate's current performance with respect to the pass/fail/crat chat ranges, similar to User:Cyberpower678/RfX Report. - MrX 🖋 12:36, 10 November 2019 (UTC)
- MrX, sure, why not?—CYBERPOWER (Chat) 13:57, 16 November 2019 (UTC)
- Thanks for your response Cyberpower678. I look forward to seeing the change on a future RfA page. Please let me know if I can help in any way (I'm not a template editor). - MrX 🖋 15:18, 16 November 2019 (UTC)
RfA precentages
Hi Cyberpower678. I would like to get your thoughts on the idea of adding a percentage calculation (and possibly color coded) to User:Cyberpower678/Tally or Template:RfX tally. This would provide a snapshot of each candidate's current performance with respect to the pass/fail/crat chat ranges, similar to User:Cyberpower678/RfX Report. - MrX 🖋 12:36, 10 November 2019 (UTC)
- @Cyberpower678: Are you able to respond to this before it's archived again? Thanks. - MrX 🖋 20:03, 14 November 2019 (UTC)
- MrX, sorry about that. For some reason, I'm not being alerted to talk page messages. :/. I've unarchived your thread and commented further up.—CYBERPOWER (Chat) 14:02, 16 November 2019 (UTC)
- No worries. I just assumed that you were busy with other things. - MrX 🖋 15:19, 16 November 2019 (UTC)
- MrX, sorry about that. For some reason, I'm not being alerted to talk page messages. :/. I've unarchived your thread and commented further up.—CYBERPOWER (Chat) 14:02, 16 November 2019 (UTC)
YGM - Urgent
YGM, please see as soon as possible. Please use email this user or ping me when replying. — xaosflux Talk 14:11, 17 November 2019 (UTC)
- Xaosflux, responded.—CYBERPOWER (Chat) 14:14, 17 November 2019 (UTC)
- ty, see reply. — xaosflux Talk 14:19, 17 November 2019 (UTC)
Duplicate parameter
not good since |url=
and |url-access=
were already in the citation with the exact same information. Frietjes (talk) 17:03, 17 November 2019 (UTC)
{{page needed}}
normally is not expected inside a cite template but I added a check. -- GreenC 19:38, 17 November 2019 (UTC)
Bot marks some available links as unavailable
Hi, your bot marks links to Scopus as unavailable, because at first visit (or sometimes) it asks authentication on freely available page, but after reload it is available. For example here. We have lots of pages with such links, can you please put links on Scopus in exceptions or something?--Igor Balashov (talk) 12:44, 6 November 2019 (UTC)
- Igor Balashov, hi to make things easier, you can use this tool to report false positives.—CYBERPOWER (Chat) 13:48, 16 November 2019 (UTC)
- Cyberpower678, I've used it to report few tens of such links yesterday, but it keeps marking even exact links that I already reported in other articles. Simply "https://www.scopus.com" should be in exceptions to never mark any links from it.--Igor Balashov (talk) 22:14, 17 November 2019 (UTC)
- Igor Balashov, I've whitelisted the domain.—CYBERPOWER (Chat) 22:32, 17 November 2019 (UTC)
- Cyberpower678, I've used it to report few tens of such links yesterday, but it keeps marking even exact links that I already reported in other articles. Simply "https://www.scopus.com" should be in exceptions to never mark any links from it.--Igor Balashov (talk) 22:14, 17 November 2019 (UTC)
User:Cyberbot I on Category:Requests for unblock, Summary section
Cyberbot I is now reporting "49 years ago" in the column "Block Expiration" on all but a few of the blocks. — Maile (talk) 11:41, 18 November 2019 (UTC)
- Maile66, I haven't gotten around to checking yet.—CYBERPOWER (Chat) 17:27, 18 November 2019 (UTC)