Notice: Undefined index: HTTP_ACCEPT_LANGUAGE in /var/www/clients/client1/web74/web/lib/common.php on line 17
FAQ about WDF*IDF & our service | Text Tools

FAQ general FAQ.

The analysis is actually not as innovative as the SEOS worldwide treat the topic. The origins of the analysis go back to the seventies, even though they naturally were not drawn for search engines or other rankings. However the formula and the calculation of terms on which the analysis is based dates back to the seventies. This information only mentioned incidentally. It is primarily all about understanding and clarifying the abbreviation, what is hidden behind the six letters:

Within Document Frequency – WDF

In words, this section describes nothing but the term weighting within a document. In turn the term in this case stands for the keyword which appears in a certain frequency in the text. This principle could represent the customary keyword density, however, WDF at the same time operates with a correction value which then allows for a uniform reference to result. In addition WDF is limited by the

Inverse Document Frequency – IDF

This area of the analysis does not only concentrate on one single document but refers the weighting of a certain term in the entire site. If, for instance in a blog only one single contribution on the site deals with photo albums the word “Garden chair” is important for the analysis – after all it is only mentioned in one single text on the site. However, if a term goes through the whole text, it is not interesting for the analysis as most probably not only one keyword is concerned.

If a text has an own DNA, the WDF*IDF analysis will reveal it. The latter of course only functions in a figurative sense, however the practical technique offers the possibility to reveal how well a certain word reflects the contents of a text. As already indicated in the example above, a word such as “and” will hardly point towards a relative content. The conjunction is found in every text and has absolutely no value to offer – other than to connect clauses and sentences. The word “garden chair” on the other hand already has a value and provides the opportunity to clearly localise search results.

This is a question which anyone who tackles the subject matter for the first time asks himself. The mathematics may interest mathematical wizards, however, it is not of interest for most of the site operators on their way to the desired ranking in the search engines. Put simply, the analysis reveals at a glance which terms are found in connection with the desired search term. Thus, based on the example “Garden chair of wood” not only
  • Garden chair
  • Wood
  • could be of importance, but at the same time the terms
  • furniture
  • outdoor
  • garden
  • patio
  • tables
  • outdoor furniture
  • chairs
  • table
  • dining
  • benches
  • The text created on the topic should therefore not only contain the main keyword “Garden chair of wood” but also the additional terms. Thereby know-how gained from the analysis could, of course, act as guideline for the developing text. It is certain that a good text on the topic will anyway contain the additional words – at least if the topic is viewed from its different aspects and in detail.
    It is a general problem that the analysis principally reveals numerous words and marks them with a clear keyword curve in the diagram. The tools provided usually operate with a maximum and an average word frequency, so that the text resulting should orientate itself as closely as possible to the average value. Yet what happens if the resulting text should have a mere 250 words, the analysis however already refers to a total of 100 words? In the latter case the text should reveal a clear focus and ignore a number of words displayed in the diagram. Should the text on the garden chairs of wood, for example, particularly emphasis that a manufacturer particularly backing sustainable timer production is concerned, it is recommended that the focus be put on the FSC seal and to associate this with tropical timber. The WDF*IDF analysis is, however, of particular benefit for long documents for which the entire range of words in connection with the search term can be used.
    Search terms or keyword densities have not completely vanished into thin air with the WDF*IDF analysis. Those wanting to deal with the new tools correctly and wanting to created their documents in line with the new know-how, should consider the analysis results as direction signs and guidelines. If matching terms and synonyms in connection with the topic also occur, the document will be found way up in the ranking of search engines. At least until the term weighting is changed again by Google & Co. and a new marvel comes into being.

    Right now, we take the first 32 result to form a corpus from which we derive the resulting chart.

    These are some lesser known facts about the charts:

    • Click-and-drag will open a rectangle. If you release the left mouse button, you will "zoom" into the chart. This makes it easier to read and analyze the terms on the x axis. Clicking "Reset zoom" will... reset the zoom to normal view.
    • Hovering over the entires on the legend will highlight this particular entry in the chart, the other 15 results are greyed out
    • Clicking on a URL in the legend will disable this entry in the chart. Click again to re-enable its display
    You are invited to share the chart with you clients or customers. Especially the "compare" tab (if available) might be interesting: if you are a writer, you can present your results without showing the actual text.

    As the search engine results change from time to time, it is not desireable to use old data. Therefore, we plan to retire results older than 6 or 9 month.

    We use highcharts.com's library.
    Internally: yes.

    For the general public: not yet.

    If you need customizations, you will need a custom installation and from there on, we are able to incorporate your APIs or custom notifications. Get in touch with us and we work simething out.

    Sure. Drop us an email and we will get in touch with you.
    Yes, we can arrange this. Please contact us for more information.
    Affiliates are welcome. We offer or own affiliate program, sign up here. We offer 30% commission for the first client payment and 15% recurring commission.

    We work on a partnership with other affiliate networks.

    We are using a Google API that delivers most results just as they are on Google. Alas, some results are mixed, most people agree, that Google does this to discourage SEOs from using their API.

    With those mixed in results, Google adds more diversity to our corpus (a.k.a sum of regarded documents) which leads to more diversified results. Some high-WDF*IDF-keywords might suffer (in terms of WDF*IDF value) from this, but you will get similar results.

    In some cases (new & emerging keywords and few relevant results), your result will be not optimal. Please wait 1 or 2 days and submit a new job for your keyword. Your results should improve.

    The German Wikipedia as an article on WDF*IDF
    DMCA.com Protection Status