Page Category Algorithm

Page Category Algorithm

PCA (Page Category Algorithm) is an in-house developed algorithm to identify the category of different pages.

During the years as an analyst I have acquired a set of models, procedures and calculations to make my life easier when working with website optimization.

Page Category Algorithm

Identification of categories; navigation, content, insignificant, indecisive, conversion
Whether it be SEO, CRO or Analysis it comes down to the purpose of the web page compared to the experienced intent of the web page. We work hard to make the pages aligned with both intent, purpose and expected delivery of the content of the page.

I have therefore come up with a calculation that uses a set of pattern recognition that relies on the web page statistics, site structure and content.

By combining everything from average time on page, bounce rate, exit rate, entrance rate, form presence, link profile, and content (plus more), we built these patterns into a machine learning model that can be refined every day.

As a user, you can even make this algorithm better, by setting the categories for the page if you should see a better fitting category. Yearbase will then recalculate and learn from the alterations.

Why Page Categories?

The entire website consists of different types of pages with different intents. Some pages drive users onwards, while others present content.

Some pages are only part of a set of pages that combined serves as a machine for driving users through a funnel, like the shopping cart page.

Some of the metrics recognized as, serves as a directional pattern to categorize the page into a specific category.

Navigational pages

The most used navigation page, is the front page of a website. It’s purpose is not to deliver content, but drive the users onward towards their destination.

Some of the patterns recognized are:

  • Lower average time on page
  • Lower bounce rate
  • Lower exit rate
  • Lower entrance rate

Content pages

Content pages are usually the pages you get ranking on, since they contain more text and serves to deliver content to the users. The patterns recognized on these pages are more or less pretty much the opposite of navigational pages.

  • Higher timer on page
  • Higher bounce rate
  • Higher exit rate
  • Higher entrance rate

Page Categories

The different page categories we have launched in the version 1 of Yearbase are:

  • Content Pages (pages with content)
  • Navigational Pages (pages to drive users onward)
  • Conversion Pages (pages that generate value)
  • Insignificant Pages (pages with too low volume)
  • Indecisive (unrecognizable pattern)

Enhanced Categorization

We are working on enhancing the Page Category Algorithm, on including further categories and maybe also find subgroups of the before mentioned categories.

We see that some pages are of greater value than others, serving a greater purpose either directly or indirectly. We are working on this enhancement for the time being.

I hope you find this feature useful, as it has served me for some years when optimizing web pages.