Wednesday 13 August 2014

SCIgen randomly generates nonsense in the form of computer science research papers that can get accepted and published by journals with low submission standards.

WikipediA
The Free Encyclopedia




SCIgen

SCIgen is a program created by scientists at the Massachusetts Institute of Technology that randomly generates nonsense in the form of computer science research papers, including graphs, diagrams, and citations. It uses a context-free grammar to form all elements of the papers, and its authors state that their aim is "to maximize amusement, rather than coherence."


Opening abstract of Rooter: A Methodology for the Typical Unification of Access Points and Redundancy:

Many physicists would agree that, had it not been for congestion control, the evaluation of web browsers might never have occurred. In fact, few hackers worldwide would disagree with the essential unification of voice-over-IP and public/private key pair. In order to solve this riddle, we confirm that SMPs can be made stochastic, cacheable, and interposable.



Prominent results

In 2005 a paper generated by SCIgen, Rooter: A Methodology for the Typical Unification of Access Points and Redundancy, was accepted as a non-reviewed paper to the 2005 World Multiconference on Systemics, Cybernetics and Informatics and the authors were invited to speak. The authors of SCIgen described their hoax on their website, and it soon received great publicity when picked up by Slashdot. WMSCI withdrew their invitation, but the SCIgen team went anyway, renting space in the hotel separately from the conference and delivering a series of randomly generated talks on their own "track." The organizer of all these conferences is Professor Nagib Callaos. The WMSCI was also sponsored by the Institute of Electrical and Electronics Engineers from 2000 until 2005. The IEEE stopped granting sponsorship to Callaos from 2006 to 2008.

Submitting the paper was a deliberate attempt to embarrass WMSCI, which the authors claim accepts low-quality papers and sends unsolicited requests for submissions in bulk to academics. As the SCIgen website states:

One useful purpose for such a program is to auto-generate submissions to conferences that you suspect might have very low submission standards. A prime example, which you may recognize from spam in your inbox, is SCI/IIIS and its dozens of co-located conferences (check out the very broad conference description on the WMSCI 2005 website).

Computing writer Stan Kelly-Bootle noted in ACM Queue that many sentences in the "Rooter" paper were individually plausible, which he regarded as posing a problem for automated detection of hoax articles. He suggested that even human readers might be taken in by the effective use of jargon ("The pun on root/router is par for MIT-graduate humor, and at least one occurrence of methodology is mandatory") and attribute the paper's apparent incoherence to their own limited knowledge. His conclusion was that "a reliable gibberish filter requires a careful holistic review by several peer domain experts".

List of works with notable acceptance

  • Rob Thomas: Rooter: A Methodology for the Typical Unification of Access Points and Redundancy, 2005 for WMSCI (see above)
  • Mathias Uslar's paper was accepted to the IPSI-BG conference.
  • Professor Genco Gulan published a paper in the 3rd International Symposium of Interactive Media Design.
  • Students at Iran's Sharif University of Technology published a paper in the Journal of Applied Mathematics and Computation (which is published by Elsevier). The students wrote under the false, non-Persian surname, MosallahNejad, which translates literally from Persian, in spite of being "non-Persian" as: "from an Armed Breed". The paper was subsequently removed when the publishers were informed that it was a joke paper.
  • A paper titled "Towards the Simulation of E-Commerce" by Herbert Schlangemann was accepted as a reviewed paper at the "International Conference on Computer Science and Software Engineering" (CSSE) and was briefly in the IEEE Xplore Database. The author is named after the Swedish short film Der Schlangemann. Furthermore the author was invited to be a session chair during the conference. Read the official Herbert Schlangemann Blog for details. The official review comment: "This paper presents cooperative technology and classical Communication. In conclusion, the result shows that though the much-touted amphibious algorithm for the refinement of randomized algorithms is impossible, the well-known client-server algorithm for the analysis of voice-over-IP by Kumar and Raman runs in _(n) time. The authors can clearly identify important features of visualization of DHTs and analyze them insightfully. It is recommended that the authors should develop ideas more cogently, organizes them more logically, and connects them with clear transitions"
  • Mikhail Gelfand published a translation of the "Rooter" article in the Russian-language Journal of Scientific Publications of Aspirants and Doctorants in August 2008. Gelfand was protesting against the journal, which was apparently not peer reviewed and was being used by Russian PhD candidates to publish in an "accredited" scientific journal, charging them 4000 Rubles to do so. The accreditation was revoked two weeks later.

Spoofing Google Scholar and h-index calculators

A 2010 paper by Cyril Labbé from Grenoble University demonstrated the vulnerability of h-index calculations based on Google Scholar output by feeding it a large set of SCIgen-generated documents that were citing each other (effectively an academic link farm). Using this method the author managed to rank "Ike Antkare" ahead of Albert Einstein for instance.

How many SCIgen papers can be found in IEEE Xplore?

A 2013 scientometrics paper demonstrated that at least 85 SCIgen papers have been published by IEEE. Springer and IEEE removed over 120 SCIgen papers according to this research.





No comments:

Post a Comment