Keywords Spam Detector
- Other text mining tools:
- Levenshtein Distance
- Positional Posting Lists
- Text Streamer
- and many more »
- Term repetitions abuse is considered an adversarial information retrieval practice known as keywords spam.
- This tool helps you write better titles, abstracts, descriptions, paragraphs, or full text by allowing you to detect and fix over-repeated terms.
- Our tool uses a proprietary algorithm for detecting frequency-based spam from a piece of text of any length.
- Once detected, over-repeated terms can be edited by either reducing their term frequency or diluting the input by adding unique terms not present in the original text.
Recommendations for Writing Titles
- Search Engines
Search engines might process entire titles, but tend to display in their search results about less than 70 characters. So you may want to limit web page titles to about this mark, like between 60 to 65 characters. - Academic Journals
Some editorial guidelines, like JAMA, limit the length of titles to 150 characters for reports of research and other major articles and 100 characters for Editorials, Viewpoints, Commentaries, and Letters. (JAMA, 2016). - Words Usage in Titles
The average character length of a word in English, Spanish, and similar languages is about six. Thus on average a 60-character title amounts to about 10 words, regardless of if these are unique terms. This is just a reference mark as text estimates can be influenced by other variables. For instance, text averages can be topic-sensitive and influenced by their syntactic structure (Busch-Lauer, 2000).
Recommendations for Writing Descriptions
- Search Engines
If the text of a meta description tag is relevant to the content of a web page, a search engine might display it as the visible abstract in a search results page. A search engine might instead display relevant text fragments taken from said web page. In both cases, a search engine might process the entire length of a summary, but display about two lines in an 80-character/line format or about three lines in a 60-character/line format. This amounts to 160 and 180 characters. So to be on the safe side you may want to limit web page descriptions to between 150 to 160 characters. Using the 6-character/word average, this amounts to about 25 to 27 words. For instance, 26 words amounts to about 156 characters. This is fair enough for most search engines, but not for academic journals. - Academic Journals
Some editorial guidelines, like JAMA, limit the length of abstracts to 350 words for reports of original research and other major articles and 200 words for major manuscripts. (JAMA, 2016). Other journals might use different abstract guidelines. Using the 6-character/word average, the mentioned word limits amount to about 2,100 and 1,200 characters. - Words Usage in Abstracts
Abstracts (descriptions) should motivate others to read the full article. A well written abstract should describe the issue to be solved, its importance, and major findings. Results-reporting abstracts are more effective than methods-describing abstracts.
- The length of a title is a relative concept. By current standards, a 60-character title, which amounts to about 10 words, is considered fair enough for search engines, very short for most academic journals, but too long for songs. Indeed. A recent study found that song titles with one or a few words are on the rise and preferred (Kopf, 2016). However, these types of titles are not informative enough for search engines and academic journals.
- Generally speaking, articles with short titles are more attractive to readers than those with longer titles because the latter are frequently perceived as complex, confusing, or boring. If readers don't find attractive a title or cannot understand it, there is a little chance that they will read or cite its abstract or the full paper (Deng, 2015; Chawla, 2015).
- A 2015 study confirmed that academic papers with short titles receive more citations per paper, being more attractive to readers than articles with longer titles (Letchford, A., Moat, H. S., and Preis, T., 2015).
- A 2012 study found that short-titled articles have higher viewing and citation rates than those with longer titles. Similarly, articles with results-describing titles are cited more often than those with methods-describing titles (Paiva, Nogueira Lima, & Ribeiro Paiva, 2012). The same study found that titles containing a question mark, containing a reference to a specific geographical region, and that used a colon or a hyphen were associated with a lower number of citations.
- Data miners, writers, journal editors, SEOs, and web content creators.
- Submit a search in Google using highly competitive keywords. Copy/paste and analyze each of the top ten titles.
- In the previous exercise: Create new titles based on the ones reported by Google.
- Analyze titles and abstracts from two scientific journal articles known to use different editorial guidelines for their lengths. What does the results tell you about the usage of stopwords and word lengths?
- Busch-Lauer, I. (2000). Titles of English and German Research Papers in Medicine and Linguistics. In Analysing Professional Genres; pp 77. Edited by Anna Trosborg.
- Chawla, D. S. (2015). In brief, papers with shorter titles get more citations, study suggests. Science Magazine. August 25, 2015.
- Deng, B. (2015). Papers with shorter titles get more citations - Intriguing correlation mined from 140,000 papers. Nature. August, 26, 2015.
- JAMA (2016). JAMA Internal Medicine Instructions For Authors.
- Kopf, D. (2016). Say Hello to the Era of One-Word Song Title. Priceonomics.com
- Letchford, A., Moat, H. S., and Preis, T. (2015). The advantage of short paper title. DOI: 10.1098/rsos.150266.
- Paiva, C. E., Nogueira Lima, J. P. S., and Ribeiro Paiva, B. S. (2012). Articles with short titles describing the results are cited more often. Clinics (Sao Paulo). 2012 May; 67(5): 509-513. DOI: 10.6061/clinics/2012(05)17; PMCID: PMC3351256.