Adding IDs to urls to open up namespace – SEO considerations?

I’m working with a site that has millions of pages along the lines of:

domain.com/entity/John_Smith domain.com/entity/Google domain.com/entity/HTML 

One of the problems is that particularly for people names, there are lots of duplicates, so we end up with

domain.com/entity/John_Smith domain.com/entity/John_Smith_(explorer) domain.com/entity/John_Smith_(politician) 

We instead want to move to a url scheme (much like StackExchange) like:

domain.com/entity/1234/John_Smith (where 1234 is a unique ID).

I assume that if we:

  • put in 301 redirects from every old page to new page
  • updated our sitemap with the new urls

then we’ll be pretty much covered. But could there be any short-term SEO implications that we should expect? (Dips in traffic, etc.?) Are there any pieces of this puzzle that we’re missing?

Remove duplicate urls from List Bug

when I select remove duplicate urls from list ( submitted, verified or any other) , ser skips some files and doesnt remove all duplicates. If any single file is selected to remove duplicates, then duplicates get removed successfully. I have attached the video too.
https://filebin.net/00u8dn0dz6b8qssk/20200101_030048.mp4?t=pkxlt641
@Sven Please check. Thanks

001 possible new Target URLs from present accounts

Hi @sven,

Ever since a recent update, when I submit posts to my network, I keep getting this:

001 possible new Target URLs from present accounts

I import say 4 URLs, and it re-submits to the last one twice… this seems to only happen when I run the project at over 1 thread.

I’ve tried:

Clear history
Clear cache
Remove unused accounts (there are none)

I also have a friend who uses my same network script and is now experiencing the same bug. It never happened before a few versions ago.

Any idea what could be causing this?

Note: “Allow posting to same site again” is UNCHECKED.

– – – – – – – – – – – – – – – – – – – – – – – – – – – – – – – –

Thinking about what could possibly be causing this bug, and being that it seems to only happen when I run at over 1 thread… I’m wondering if there’s an issue with “unused threads” being forced to submit somewhere. Take for example, if you import 4 URLs and use 2 threads.

Thread 1 -> URL1

Thread 2 -> URL2

Thread 1 -> URL3

Thread 1 -> URL4

Thread 2 -> This thread is now ‘unused’ and needs something to do, and perhaps this could be why it’s forcibly re-submitting to “present accounts” even though “Allow posting on same site again” is unchecked?

No idea if this could be causing it, but I can confirm that it only happens when running over 1 thread. With only 1 thread, the issue never happens.

Here is the log (domains changed for privacy):

22:49:13: [ ] Loaded 4 URLs from imported sites 22:49:22: [+] 1/4 matches engine WordPress-Selfhosted - https://www.domain1.com/wp-login.php?redirect_to=https%3A%2F%2Fwww.domain1.com%2Fwp-admin%2F&reauth=1 22:49:22: [+] 1/4 registration successful - https://www.domain1.com/wp-admin/ 22:49:22: [+] 1/4 new URL - https://www.domain1.com/wp-login.php 22:49:22: [+] 1/4 login successful - https://www.domain1.com/wp-admin/ 22:49:22: [+] 1/4 new URL - https://www.domain1.com/wp-admin/post-new.php 22:49:22: [+] 1/4 new URL - https://www.domain1.com/17/post-url-here.html 22:49:22: [+] 1/4 submission successful (1 submitted - AVG: 397.88/h) - https://www.domain1.com/17/post-url-here.html 22:49:22: [+] 2/4 matches engine WordPress-Selfhosted - https://www.domain2.com/wp-login.php?redirect_to=https%3A%2F%2Fwww.domain2.com%2Fwp-admin%2F&reauth=1 22:49:22: [+] 2/4 registration successful - https://www.domain2.com/wp-admin/ 22:49:22: [+] 2/4 new URL - https://www.domain2.com/wp-login.php 22:49:22: [+] 2/4 login successful - https://www.domain2.com/wp-admin/ 22:49:22: [+] 2/4 new URL - https://www.domain2.com/wp-admin/post-new.php 22:49:22: [+] 2/4 new URL - https://www.domain2.com/post-url-here/ 22:49:22: [+] 2/4 submission successful (2 submitted - AVG: 791.64/h) - https://www.domain2.com/post-url-here/ 22:49:29: [+] 3/4 matches engine WordPress-Selfhosted - http://www.domain3.com/wp-login.php?redirect_to=http%3A%2F%2Fwww.domain3.com%2Fwp-admin%2F&reauth=1 22:49:29: [+] 3/4 registration successful - http://www.domain3.com/wp-admin/ 22:49:29: [+] 3/4 new URL - http://www.domain3.com/wp-login.php 22:49:29: [+] 3/4 login successful - http://www.domain3.com/wp-admin/ 22:49:29: [+] 3/4 new URL - http://www.domain3.com/wp-admin/post-new.php 22:49:29: [+] 3/4 new URL - http://www.domain3.com/post-url-here/ 22:49:29: [+] 3/4 submission successful (3 submitted - AVG: 671.47/h) - http://www.domain3.com/post-url-here/ 22:49:29: [+] 001 possible new Target URLs from present accounts. 22:49:31: [+] 4/4 matches engine WordPress-Selfhosted - https://www.domain4.com/wp-login.php?redirect_to=https%3A%2F%2Fwww.domain4.com%2Fwp-admin%2F&reauth=1 22:49:31: [+] 4/4 registration successful - https://www.domain4.com/wp-admin/ 22:49:31: [+] 4/4 new URL - https://www.domain4.com/wp-login.php 22:49:31: [+] 4/4 login successful - https://www.domain4.com/wp-admin/ 22:49:31: [+] 4/4 new URL - https://www.domain4.com/wp-admin/post-new.php 22:49:31: [+] 4/4 new URL - https://www.domain4.com/post-url-here.php 22:49:31: [+] 4/4 submission successful (4 submitted - AVG: 793.00/h) - https://www.domain4.com/post-url-here.php 22:49:40: [+] 1/1 matches engine WordPress-Selfhosted - https://www.domain4.com/wp-login.php?redirect_to=https%3A%2F%2Fwww.domain4.com%2Fwp-admin%2F&reauth=1 22:49:40: [+] 1/1 registration successful - https://www.domain4.com/wp-admin/ 22:49:40: [+] 1/1 new URL - https://www.domain4.com/wp-login.php 22:49:40: [+] 1/1 login successful - https://www.domain4.com/wp-admin/ 22:49:40: [+] 1/1 new URL - https://www.domain4.com/wp-admin/post-new.php 22:49:40: [+] 1/1 new URL - https://www.domain4.com/post-url-here-2.php 22:49:40: [+] 1/1 submission successful (5 submitted - AVG: 663.11/h) - https://www.domain4.com/post-url-here-2.php

Site country on Verified urls

Hi!
I think it will be nice and handy to have the country column in the Verified list of urls. If GSA can identify the country (based on domain tld and others), it will be helpful for guys who aren’t working with xxx,xxx,xxx of urls to avoid certain tlds/hosts. I know it can be done while posting, but I think it can be useful if you want to re-target certain sites/languages.
Not sure if it’s easy or not, but the data is already collected while posting (in the Last Verified).
What do you think?

Why are file urls marked as not secure while https urls marked as secure in browsers?

I understand that modern https protocols are secure enough for an average user.

However, since these protocols need to fetch files from the server while the file protocol just tells the browser to find the file in your computer, I think file protocols are more secure than https protocols.

Thus, why are browsers marking https protocols secure while marking file protocols not secure?

Configure a Google Analytics filter for blog posts when URLs are mixed with other content

I have a WordPress site with a blog section but also a lot of other content, that is not directly related to the blog. Now I want to use Google Analytics and only see the blog traffic, meaning users that click on blog posts.

I know there are ways to filter this, if the blog posts reside in a subfolder, like example.com/blog/my-blog-post, but on my site the link structure doesn’t have that. It’s like this: example.com/my-blog-post

What would be a good way to filter that traffic in Google Analytics? It should also work for any future published posts. I have the Google Tag Manager connected, maybe that might provide a viable option?