Info for Site Owners

Why Should I Add My Site to Pine.blog?

Blogs come in all shapes and sizes from newspapers, to personal blogs to twitter-like microblogs, and Pine.blog is the open search directory for it all. Adding your site to Pine.blog's growing index of blogs, microblogs, and photoblogs can help your site get more exposure and help Pine.blog users more easily discover and recommend your site. Additionally, Pine.blog directory is available to other apps via the official API, so once you add your site to Pine.blog, it will show up in any app using Pine.blog's API.

What Happens When I Add My Site?

When a user submits a link to Pine.blog's index, a request is sent to the site and its contents are crawled to determine if it's a feed, or a site. Feeds are added directly to the Pine index with a link to their parent site. A site, however, is inspected for any feeds on the given page and any feeds found this way are tied to the site.

Once a site has been inspected, Pine.blog needs to determine which feed (if multiple) is the site's main feed since, by default, when a user follows a site they're subscribed to its main feed. Pine.blog will always prefer links to Feedburner and JSON Feeds when multiple feeds are found. If no feed falls into this category, then the feed with the shortest URL is used (since this is most likely to be the home feed of the site). In all cases, feeds that are most likely comments feeds (i.e. WordPress Comments Feeds) are almost never considered the main feed.

In an effort to simplify and to prevent spam, Pine.blog populates a site's data with information found in its feeds. No attempts are made to gather information from the host site since RSS and JSON Feeds provide all the information needed.

Subscriber Counts

When Pine.blog crawls a feed on your site looking for new posts, the User-Agent header in the request will have two important pieces of information that you can use for your own analytics: the id of the feed in Pine.blog, and the number of users on Pine.blog that are subscribed to that feed. For reference, the header you'll find will look something like this:

User-Agent: Pine.blog/1.5 Feed Crawler (156 subscribers; feed-id=some-uuid; +https://pine.blog/)

Important Note: Pine.blog also requests various resources from your site when importing a site for the first time, or when a feed changes. These requests are often made to your site's home page or to any media that's linked in new posts. Requests made this way will not include any subscriber counts. Counts are only present in requests for the actual feed.

Additionally, Pine.blog tries to determine various metadata about a post from its contents and although Pine.blog does make requests to your site for linked media, they are not cached or stored by Pine.blog. Users will always request resources and media directly from your servers.

Addition Crawling Info

Please ensure the following servers are not prevented from accessing your site. Blocking these servers will cause all Pine.blog users from being able to receive updates from your site.

  • worker1.pine.blog
  • worker2.pine.blog
  • worker3.pine.blog

Standard Caching Headers are obeyed for up to 1 day. Pine.blog will automatically refetch the contents of your site at least once per day.

Payments and Contact Information

When Pine.blog crawls your feed, it may make additional requests to the pages linked to by the feed. When doing so, Pine.blog collects rel=payment and rel=contact links. Pine.blog will show icons on your site's pages and in the Pine.blog app to encourage people to contact you and support your work.

Pine.blog does not handle or process your payments in any way nor does Pine.blog handle any communications between your site and your readers. It just displays a link to any resources you've added to let your readers know how to support your work and get in touch.

Webmentions

Pine.blog sends and receives Webmentions. When a Pine.blog user likes, comments on, or replies to a post, Pine.blog will send Webmentions to the relevant site. To receive Webmentions for Pine.blog, all you need to do is ensure that your site is compatible with the Endpoint Discovery methods described in the Webmention Specification (all methods are supported).

Webmention Requests will always be made with the following User-Agent string.

Pine.blog/1.0 Webmention Discovery support@skyrocket.software

Pine.blog also receives Webmentions. You can send Webmentions to the following URL:

https://pine.blog/api/webmention

Webmention Icons and Follower Status

Pine.blog uses h-entry classes in the target page's markup to determine what icons, name, and URL to show in the content of the Webmention. Pine.blog supports replies, mentions, and likes. Reposts are shown as mentions.

Users can elect to only show the comments, mentions, or replies from sites they follow on their blog. In this case, Pine.blog uses h-entry tags (specifically the p-author and u-url tags) to determine if a given site is one that the user follows. If your site does not support h-entry classes, then it will be omitted in all cases where the user has selected this option.