Project

General

Profile

Wiki » History » Revision 2

Revision 1 (Komorek, Kamil, 27.11.2016 09:53) → Revision 2/5 (Komorek, Kamil, 27.11.2016 09:55)

h1. dex_chan - 4chan and any chan board mass-image downloader 

 That app allows you to download all images from any 4chan-similar board. 
 It's universal downloader that uses included boards.xml file, which allow you to specify any internet-public site and any chan.  
 Boards.xml may be edited by You if you need any other chan to support. Deleting file will download default from my server. 
 Supports autoupdate for boards file. 

 !http://i.imgur.com/R4ROy9l.jpg! 

 *Features:* 

 * Mass image-videos-files download from all topics 
 * Flexible support - read below and edit boards.xml file if your fav chan is not supported 
 * Multithreaded downloader 
 * "Inteligent" queue 

 Language: Polish (but it's easy in use :) ) 

 Have fun! 

 h2. Boards changelog 

 * 201505092200 - fixed 4chan patterns,  
 * 201211302330 - first initial version, + 7chan support 

 *How to update?* Just delete boards.xml and restart application. 

 h2. Requirements 

  * It's written in C# .NET 3.5, may require latest M$ Windows updates and redistributed packages 
  * It requires internet connection 

 h2. Download 

 Current Version: 1.1 
 Latest: http://dl.dexterxx.pl/pr/dex_chan/dex_chan.exe 

 h2. More to read: 

 * http://blog.dexterxx.pl/2012/12/01/hello-grudzien-dex_chan-v1-1/ 
 * http://blog.dexterxx.pl/2015/02/16/wypiem-platforme-projektow-dexterxxprojects-bo-nie-mialem-cierpliwosci-do-drupala/ 

 h2. Boards.xml quick tutorial 

 Open @boards.xml@, best is learn by example, so look at 4chan definition: 

 <pre><code class="xml"> 
 <site> 
	 <name>4chan.org</name> 
	 <!-- Below is first-page board link look, eg.: 
		 http://boards.4chan.org/g/1 
		 http://boards.4chan.org/tv/1 
		 etc., 
		 (first page may sometimes not work, but if you try /g/3 (as 3rd page) so you know that is good) 
			 @BOARD@ and @PAGE are required to specify. --> 
	 <board_url>http://boards.4chan.org/@BOARD@/@PAGE@</board_url> 
	 <!-- Below is link to thread, eg.: 
		 http://boards.4chan.org/g/thread/57704107 
		 looks similar? g - @BOARD@ and 57704107 is @TID@ - thread id 
	 ... --> 
	 <board_url_topic>http://boards.4chan.org/@BOARD@/thread/@TID@</board_url_topic> 
	 <!-- Below is url to target image (not thumbinal), eg.: 
		 http://i.4cdn.org/g/1480199776841.jpg 
		 so g - @BOARD@, and 14801... is @FILE@ - easy? 
	 --> 
	 <board_url_img>http://i.4cdn.org/@BOARD@/@FILE@</board_url_img> 
	 <!-- Now more complex... below is regexp which allows app to recognize what is what in board 
		 MATCHED must be in first () - that may be little difficult, sorry for that 
	 --> 
	 <patterns> 
		 <!-- Link to topic (often it's "Reply" pattern) - it resolves to @TID@ --> 
		 <reply><![CDATA[<a href=\"thread/([0-9]{2,15})/?[a-zA-Z0-9-]+?\".?(class=\"replylink\")?>Reply</a>]]></reply> 
		 <!-- Resolves to @FILE@ - only filename - 1480199776841.jpg from our above example 
			 IMPORTANT: first match must resolves to filename and second match to extension 
			 so: 1480199776841.jpg -> 1: (1480199776841) 2: (jpg) 
			 dunno why i did that in past ;) 
		 --> 
		 <image><![CDATA[<a class=\"fileThumb\" href=\"//i\.4cdn\.org/[a-zA-Z0-9]+/([0-9]+)\.([a-zA-Z0-9]{2,5})\" target=\"_blank\">]]></image> 
	 </patterns> 
	 <!-- Below is boards on that chan 
		 Values are injected into @BOARD@ variable. 
		 --> 
	 <boards_list> 
		 <b>a</b> 
		 <b>b</b> 
		 <b>c</b> 
		 <b>d</b> 
		 <!-- ... --> 
		 <b>vp</b> 
		 <b>wsg</b> 
		 <b>x</b> 
	 </boards_list> 
	 <!-- That's all - easy as fuck, is it? --> 
 </site> 
 </code></pre> 


 h2. Changelog 

 2012-12-01 / v1.1 
 * + *Support for boards.xml!* 
 * + Version information 
 * * Better stability  
 * * Bugfixes 

 2012-06-24 / v1.0 
 * + First initial version 
 * + Supports only 4chan