Long time no love for seobayers so iam writing this tutorial for you guys:

One of the biggest mysteries about xrumer has been creating profiles and me myself being a noob , but a smart one:P, always try to find ways for creating thins easier. Recently i was doing a test and found a good way of generating profile links easily. Before we start i will post results:

Tools- Hrefer 3.3, Xrumer 7.5, Scrapebox, Adword tool for keyword, Keyword studio pro and a linklist merger

Linklist generated- 250k (unique domains)
Profiles generated- 60k (unique domains) - a lot in just 20 days.

Step 1
Generate keywords which people actually try to rank for. This is the most crucial step in profile building. I saw a huge list being shared on another forum which had over 300k words. I say what is the use for searching keywords like "aaaaa", "abra ka dabra" etc. Who will rank for these? So i used common sense and generated keywords which people actually use for ranking.

Usage -
1) Note down atleast 100 keywords from google search (not adwords) which are broad and have massive competition like credit card, insurance etc.

2) One-by-one load these keywords into keyword studio pro (i used crack version:P) and generate similar keywords. Note that these words are better than scrapebox wonderwheel scraper.

3) Load all these keywords into scrapebox and generate keywords upto 3 levels. De-dup them etc. and save them. Then in the end make a huge list of all the keywords generated from original 100 words.

Step 2
Load them into hrefer but before you begin do this:

1) Edit additive words and just use these:

"powered by smf"
"powered by vbulletin"
"powered by phpbb"
"powered by ip.board"

Etc. etc. etc. but only forum software names. I did this so i don"t know which are most useful.

2) Edit search engine filter and place this:

Member.php?action=
profile.php?viewprofile=

Etc. etc. basically all those filters which have view profile in their url.

The logic is to get maximum of those profiles which have been made for the purpose of visible backlinks. (Iam having trouble loggin into my dedi so will post the complete list later)

3) Start scraping and wait for it to complete.

Step 3- Another very important step.

Usually you all just load the links in xrumer and blast away but no you should analyze the database also to remove dead ones. So click on tools -> database analysis

1) Type the same "powered by" stuff into the box on left hand side.

2) Choose your linklist and select search content and 50 threads and go. This will remove all 404 forums and blogs etc.

Now we run xrumer with this linklist.

1) Load the database and setup your project.

2) Goto advanced options -> select register only, goto speed<--->success and move all the way to success and select recaptcha with 40-50 atempts.

3) Start and let it complete.

While the posting is done i usually move to the next keyword building step.

When all your profiles are created, load them into linklist merger and create a big file from it. Now remove dups and all and you have a good profile link package.

So the jist is - keywords which people search - only few additive words and filters - right settings etc.

You will notice that from your final linklist with medium success and recpatcha off and 20-25 atempts for captcha you will get approx 30k+ profiles. I get 60k with recaptcha and half-success

Happy profile creation....

Let's start right away with the main script code:

#!/usr/bin/perl

# which-forum.pl script
# (c) 2010 Alexandr A Alexeev, http://site/

use strict;

# commented lines - for rigor
# if the task is to collect engine statistics, leave it as is
# if you make a list of forums - uncomment

my $data ;
$data .= $_while (<> ) ;

# check how much was Powered by phpBB without a link in the footer
print "phpbb \n"
if ($data =~ /]+href="[^"]*http:\/\/(?:www\.)?phpbb\.com\/?"[^>]*>phpBB/i or
# $data =~ /viewforum\.php\?[^""]*f=\d+/i or
$data =~ /phpBB\-SEO/i or
$data =~ /) ;
print "ipb \n"
if ($data =~ /]+href="[^"]*http:\/\/(?:www\.)?invision(?:board|power)\.com\/?[^"]*"[^>]*> [^<]*IP\.Board/i or
$data =~ /]+href="[^"]*http:\/\/(?:www\.)?invisionboard\.com\/?"[^>]*>Invision Power Board/i or
$data =~ /

/i or
$data =~ /index\.php\?[^""]*showforum=\d+/i) ;
print "vbulletin \n"
if ($data =~ /Powered by:?[^<]+vBulletin[^<]+(?:Version)?/i or
$data =~ /) ;
print "smf \n"
if ($data =~ /]+href="[^"]*http:\/\/(?:www\.)?simplemachines\.org\/?"[^>]*>Powered by SMF/i or
$data =~ /index\.php\?[^""]*board=\d+\.0/i) ;
print "punbb \n"
if ($data =~ /]+href="[^"]*http:\/\/(?:(?:www\.)?punbb\.org|punbb\.informer\.com)\/?"[^>]*> PunBB/i) ;
#or
# $data =~ /viewforum\.php\?[^""]*id=\d+/i); \n"
print "fluxbb
# if($data =~ /viewtopic\.php\?id=\d+/i or /if ( $data =~) ;
]+href="http:\/\/(?:www\.)fluxbb\.org\/?"[^>]*>FluxBB/i \n"
if ($data =~ /print "exbb]+href="[^"]*http:\/\/(?:www\.)?exbb\.org\/?"[^>]*>ExBB/i
) ;
# or \n"
if ($data =~ /# $data =~ /forums\.php\?[^""]*forum=\d+/i); or
print "yabb
]+href="[^"]*http:\/\/(?:www\.)?yabbforum\.com\/?"[^>]*>YaBB/i \n"
if ($data =~ $data =~ /YaBB\.pl\?[^""]*num=\d+/i ) ;<\/title>/i or
$data =~ /print "dleforum<\/a>/i) ;
/\(Powered By DLE Forum\) \n"
if ($data =~ /]+href="[^"]+(?:http:\/\/(?:www\.)?dle\-files\.ru|act=copyright)[^"]*">DLE Forum or
$data =~ /]+href="[^"]*http:\/\/(?:www\.)?ikonboard\.com\/?[^"]*"[^>]*>Ikonboard/i \n"
if ($data =~ /]*>ExBB/i
$data =~ /ikonboard\.cgi/i ) ;
print "bbpress
# $data =~ /topic\.php\?id=\d+/i or \n"
# $data =~ /forum\.php\?id=\d+/i);
print "flashbb
if ($data =~ /# if($data =~ /forums\.php\?fid=\d+/i or) ;
# $data =~ /topic\.php\?fid=\d+/i or \n"
]+href="http:\/\/(?:www\.)?flashbb\.net\/?"[^>]*>FlashBB/i
if ($data =~ /print "stokesit) ;
# if($data =~ /forum\.php\?f=\d+/i or \n"
]+href="http:\/\/(?:www\.)?stokesit\.com\.au\/?"[^>]*>[^\/]*Stokes IT/i
if ($data =~ /print "podium) ;
# if($data =~ /topic\.php\?t=\d+/i or \n"
]+href=[""]?http:\/\/(?:www\.)?sopebox\.com\/?[""]?[^>]*>Podium/i
if ($data =~ /print "usebb) ;
# if($data =~ /forum\.php\?id=\d+/i or \n"
]+href="http:\/\/(?:www\.)?usebb\.net\/?"[^>]*>UseBB/i
if ($data =~ /print "wrforum) ;
# if($data =~ /index\.php\?fid=\d+/i or \n"
]+href="http:\/\/(?:www\.)?wr\-script\.ru\/?"[^>]*>WR\-Forum/i
$data =~ print "yetanotherforumnet) ;

You will find this and other scripts mentioned in the post in this archive.

Script which-forum.pl examines the HTML page code to see if it contains signatures of the forum engine. We used a similar technique when defining WordPress and Joomla, but there are a couple of differences. Firstly, the script itself does not load the page code, but reads it from stdin or a file passed as an argument. This allows you to download the page once, for example, using wget, and then run it through several analyzers, if we have more than one. Secondly, in this script the presence of a signature is 100% a sign of the engine. Last time, the presence of a signature only added weight to the corresponding engine and the engine with the greatest weight “won”. I decided that in this case, such an approach would only unnecessarily complicate the code.

To test the script, I did some research. I compiled a list of several thousand forums and ran each of them through my script, thereby determining the percentage of program responses and the popularity of various engines.

To get the list of forums, I used my Google parser. Queries like this were sent to the search engine

site:forum.*.ru
site:talk.*.ru
site:board.*.ru
site:smf.*.ru
site:phpbb.*.ru
....

and so on. You will find the complete query generator code in the file gen-forumsearch-urls.pl. In addition to zone.ru, .su .ua .kz and .by were also used. Last time, it was difficult to conduct such a study, since WordPress and Joomla sites do not have such signatures in the URL. Catalogs like cmsmagazine.ru/catalogue/ do not provide a sufficient sample size. What is 600 Drupal sites?

I must admit, the results of the experiment disappointed me. Of the 12,590 sites studied, the engine was successfully identified on only 7,083, that is, only in 56% of cases. Maybe I didn't take into account some engine? Was it really true that half of the forums had Bitrix installed? Or should I have spent more time searching for signatures? In general, additional research is required here.

Among the 56% of successfully identified engines, the most popular, as expected, were IPB (31%), phpBB (26.6%) and vBulletin (26.5%)

They are followed with a large lag by SMF (5.8%) and DLEForum (5.3%). My favorite punBB was only in 6th place (1.64%). I wouldn’t recommend putting much faith in these numbers (they say that every third forum on the RuNet runs on IPB), but certain conclusions can, of course, be drawn.

For example, if you intend to make a site on a forum engine and plan to modify the forum, say, pay users $0.01 for each message with automatic withdrawal of funds once a week, then you should choose one of the three most popular engines. The more popular the forum, the greater the chances of finding a programmer who is well versed in it.

If no significant changes are expected in the engine, then it may make sense to choose a less popular engine, for example SMF or punBB. This will reduce the number of hacker attacks on your forum and the amount of spam automatically sent on it.

Scripts for searching/identifying forums can also find many practical applications. The first thing that came to my mind was to sort the identified forums by TIC and post on the first hundred posts with links to one of my sites. However, hundreds of forum dofollow links did not affect the TCI in any way (2 updates have passed), so it is better not to waste time here, unless you are not interested in transitions.

It is clear that the mentioned use of scripts is far from the only one. I think you can easily figure out how else you can use them.

views