Ldspider
Author: s | 2025-04-24
Automatically exported from code.google.com/p/ldspider - ldspider/pom.xml at master Free16t/ldspider Automatically exported from code.google.com/p/ldspider - ldspider/pom.xml at master Guohun/ldspider
GettingStartedCommandLine - ldspider/ldspider GitHub Wiki
0 selectedAndreas Kahl, Jonathan Austin29/26/18Maven Repository to use LDSpider as a Dependency in a Java projectHello Andreas, Upon starting my honours project this year, the only place that seems to have the mostunread,Maven Repository to use LDSpider as a Dependency in a Java projectHello Andreas, Upon starting my honours project this year, the only place that seems to have the most9/26/18grant....@griffithuni.edu.au, tobias...@kit.edu28/28/18Creating a seeds fileHi Grant, great to see continuing interest in LDSpider. LDSpider can follow RDF links. Hence, ifunread,Creating a seeds fileHi Grant, great to see continuing interest in LDSpider. LDSpider can follow RDF links. Hence, if8/28/18Andrew Berezovskyi, Andreas Harth24/18/17LDSpider runner scriptAndrew, thanks for sharing the script! Cheers, Andreas. On 04/14/17 20:53, Andrew Berezovskyi wrote:unread,LDSpider runner scriptAndrew, thanks for sharing the script! Cheers, Andreas. On 04/14/17 20:53, Andrew Berezovskyi wrote:4/18/17Danila Feitosa10/19/16Quality of linked datasetDear researcher / professional, You are being invited to participate in a survey that evaluates theunread,Quality of linked datasetDear researcher / professional, You are being invited to participate in a survey that evaluates the10/19/16Pradeep Kumar, Andreas Harth25/27/15crawl from local serverSure, if the data is available via HTTP URIs (localhost, I presume). There should be no differenceunread,crawl from local serverSure, if the data is available via HTTP URIs (localhost, I presume). There should be no difference5/27/15Pradeep Kumar, Andreas Harth45/22/15LDspider on hadoopHi Pradeep, On 2015-05-22 14:36, Pradeep Kumar wrote: > Does this means, we cant run ldspider onunread,LDspider on hadoopHi Pradeep, On 2015-05-22 14:36, Pradeep Kumar wrote: > Does this means, we cant run ldspider on5/22/15Ran, Andreas Harth25/15/15"Hook previously registered" ProblemHi Ran, do you try to re-use the objects? Could you do "new" when you use the objects aunread,"Hook previously registered" ProblemHi Ran, do you try to re-use the objects? Could you do "new" when you use the objects a5/15/15Samita Chanchal4/28/15jar file of LDSpider?Where can I find jar file of LDSpider?unread,jar file of LDSpider?Where can I find jar file of LDSpider?4/28/15wenqiang liu, Alberto Trindade Tavares34/22/15How can I get the source code of LDSpider?Thank you very much. but the source code can't run in the eclipse with bug that the bad versionunread,How can I get the source code of LDSpider?Thank you very much. but the source code can't run in the eclipse with bug that the bad version4/22/15Andreas Harth1/8/15Re: Inquire about SWSE SystemHi, I've cc'ed the mailing list as other people might be interested as well. On 2015-01-08 03unread,Re: Inquire about SWSE SystemHi, I've cc'ed the mailing list as other people might be interested as well. On 2015-01-08 031/8/15Andreas Harth, Valentino Hudhra22/24/14Meaning of Mode.ABOX_AND_TBOX_EXTRAROUND?Hi Andreas, did you get an answer for this? I would be quite interested to know. Cheers, Valentino Onunread,Meaning of Mode.ABOX_AND_TBOX_EXTRAROUND?Hi Andreas, did you get an answer for this? I would be quite interested to know. Cheers, Valentino On2/24/14Valentino Hudhra, Andreas Harth22/6/14Crawling Tbox - SeedsHi Valentino, On 02/06/2014 07:33 PM, Valentino Hudhra wrote: > firs, kudos for the tbox_onlyunread,Crawling Tbox - SeedsHi Valentino, On 02/06/2014 07:33 PM, Valentino Hudhra wrote: > firs, kudos for the tbox_only2/6/14albertonm819/7/13Some questionsDear all I have developed a little program in order Automatically exported from code.google.com/p/ldspider - ldspider/pom.xml at master Free16t/ldspider Automatically exported from code.google.com/p/ldspider - ldspider/pom.xml at master Guohun/ldspider To download some datasets in RDF. Here is someunread,Some questionsDear all I have developed a little program in order to download some datasets in RDF. Here is some9/7/13Thang Chi Duong7/26/13Problem with seed listHello, I try using LDSpider with the following seeds: http:/unread,Problem with seed listHello, I try using LDSpider with the following seeds: http:/7/26/13Andreas Harth, Tobias Käfer47/15/13RDFXMLParser emit ASCII N-TriplesHi Andreas, seems like you are using an outdated version of NxParser for your crawling. In r110 (Augunread,RDFXMLParser emit ASCII N-TriplesHi Andreas, seems like you are using an outdated version of NxParser for your crawling. In r110 (Aug7/15/13Altruist, Andreas Harth26/18/13Make LDSpider behave like a non linked data spider.Hi, On 18/06/13 02:41, Altruist wrote: > Is it possible for the LDSpider to behave like regularunread,Make LDSpider behave like a non linked data spider.Hi, On 18/06/13 02:41, Altruist wrote: > Is it possible for the LDSpider to behave like regular6/18/13Altruist, … Andreas Harth36/3/13Any idea about why I get this exception?Hi, the server does not respond quickly enough, leading to a timeout. AFAIK there are command lineunread,Any idea about why I get this exception?Hi, the server does not respond quickly enough, leading to a timeout. AFAIK there are command line6/3/13Florian Kleedorfer, … Jürgen Umbrich95/15/13Frequent crawling with same seed - download only delta?On Friday, May 10, 2013 7:00:51 PM UTC-3, juum wrote: great that it is of interest. It would be veryunread,Frequent crawling with same seed - download only delta?On Friday, May 10, 2013 7:00:51 PM UTC-3, juum wrote: great that it is of interest. It would be very5/15/13albertonm81, Andreas Harth45/9/13Starting with LDSpiderThanks for the information about Frontiers, but maybe I didn't explain myself correctly. What Iunread,Starting with LDSpiderThanks for the information about Frontiers, but maybe I didn't explain myself correctly. What I5/9/13Ron Koron, … Florian Kleedorfer105/7/13Writing to Triple StoreHi, I think I found a solution: It seems virtuoso doesn't like 'INSERT DATA INTO [graph] ..unread,Writing to Triple StoreHi, I think I found a solution: It seems virtuoso doesn't like 'INSERT DATA INTO [graph] ..5/7/13Umutcan Şimşek, Jürgen Umbrich23/22/13specifying predicate filterHi, sorry for the late repsonse The wiki pageo n the google project page should contain the necessaryunread,specifying predicate filterHi, sorry for the late repsonse The wiki pageo n the google project page should contain the necessary3/22/13Altruist, Andreas Harth62/17/13LDSPider Warning when using with Fuseki ServerAndreas, This is how the deprecated URIs are being generated. CREATE+SILENT+GRAPH+%3Chttp%3A%2F%unread,LDSPider Warning when using with Fuseki ServerAndreas, This is how the deprecated URIs are being generated. CREATE+SILENT+GRAPH+%3Chttp%3A%2F%2/17/13Altruist2/14/13DEprecated SPARQL 1.1 INSERT Queries.Hi All, The current latest version of LDSpider creates deprecated SPARQL 1.1 INSERT Queries.Are thereunread,DEprecated SPARQL 1.1 INSERT Queries.Hi All, The current latest version of LDSpider creates deprecated SPARQL 1.1 INSERT Queries.Are there2/14/13Altruist, Andreas Harth52/2/13Exception while parsing a robots.txtThanks a lot Andreas though I will make that change thanks. So is this an issue with Norbert code isunread,Exception while parsing a robots.txtThanks a lot Andreas though I will make that change thanks. So is this an issue with Norbert code is2/2/13Altruist1/21/13Iteratively call the crawler.Hello All, I need to crawl a lrge setComments
0 selectedAndreas Kahl, Jonathan Austin29/26/18Maven Repository to use LDSpider as a Dependency in a Java projectHello Andreas, Upon starting my honours project this year, the only place that seems to have the mostunread,Maven Repository to use LDSpider as a Dependency in a Java projectHello Andreas, Upon starting my honours project this year, the only place that seems to have the most9/26/18grant....@griffithuni.edu.au, tobias...@kit.edu28/28/18Creating a seeds fileHi Grant, great to see continuing interest in LDSpider. LDSpider can follow RDF links. Hence, ifunread,Creating a seeds fileHi Grant, great to see continuing interest in LDSpider. LDSpider can follow RDF links. Hence, if8/28/18Andrew Berezovskyi, Andreas Harth24/18/17LDSpider runner scriptAndrew, thanks for sharing the script! Cheers, Andreas. On 04/14/17 20:53, Andrew Berezovskyi wrote:unread,LDSpider runner scriptAndrew, thanks for sharing the script! Cheers, Andreas. On 04/14/17 20:53, Andrew Berezovskyi wrote:4/18/17Danila Feitosa10/19/16Quality of linked datasetDear researcher / professional, You are being invited to participate in a survey that evaluates theunread,Quality of linked datasetDear researcher / professional, You are being invited to participate in a survey that evaluates the10/19/16Pradeep Kumar, Andreas Harth25/27/15crawl from local serverSure, if the data is available via HTTP URIs (localhost, I presume). There should be no differenceunread,crawl from local serverSure, if the data is available via HTTP URIs (localhost, I presume). There should be no difference5/27/15Pradeep Kumar, Andreas Harth45/22/15LDspider on hadoopHi Pradeep, On 2015-05-22 14:36, Pradeep Kumar wrote: > Does this means, we cant run ldspider onunread,LDspider on hadoopHi Pradeep, On 2015-05-22 14:36, Pradeep Kumar wrote: > Does this means, we cant run ldspider on5/22/15Ran, Andreas Harth25/15/15"Hook previously registered" ProblemHi Ran, do you try to re-use the objects? Could you do "new" when you use the objects aunread,"Hook previously registered" ProblemHi Ran, do you try to re-use the objects? Could you do "new" when you use the objects a5/15/15Samita Chanchal4/28/15jar file of LDSpider?Where can I find jar file of LDSpider?unread,jar file of LDSpider?Where can I find jar file of LDSpider?4/28/15wenqiang liu, Alberto Trindade Tavares34/22/15How can I get the source code of LDSpider?Thank you very much. but the source code can't run in the eclipse with bug that the bad versionunread,How can I get the source code of LDSpider?Thank you very much. but the source code can't run in the eclipse with bug that the bad version4/22/15Andreas Harth1/8/15Re: Inquire about SWSE SystemHi, I've cc'ed the mailing list as other people might be interested as well. On 2015-01-08 03unread,Re: Inquire about SWSE SystemHi, I've cc'ed the mailing list as other people might be interested as well. On 2015-01-08 031/8/15Andreas Harth, Valentino Hudhra22/24/14Meaning of Mode.ABOX_AND_TBOX_EXTRAROUND?Hi Andreas, did you get an answer for this? I would be quite interested to know. Cheers, Valentino Onunread,Meaning of Mode.ABOX_AND_TBOX_EXTRAROUND?Hi Andreas, did you get an answer for this? I would be quite interested to know. Cheers, Valentino On2/24/14Valentino Hudhra, Andreas Harth22/6/14Crawling Tbox - SeedsHi Valentino, On 02/06/2014 07:33 PM, Valentino Hudhra wrote: > firs, kudos for the tbox_onlyunread,Crawling Tbox - SeedsHi Valentino, On 02/06/2014 07:33 PM, Valentino Hudhra wrote: > firs, kudos for the tbox_only2/6/14albertonm819/7/13Some questionsDear all I have developed a little program in order
2025-04-03To download some datasets in RDF. Here is someunread,Some questionsDear all I have developed a little program in order to download some datasets in RDF. Here is some9/7/13Thang Chi Duong7/26/13Problem with seed listHello, I try using LDSpider with the following seeds: http:/unread,Problem with seed listHello, I try using LDSpider with the following seeds: http:/7/26/13Andreas Harth, Tobias Käfer47/15/13RDFXMLParser emit ASCII N-TriplesHi Andreas, seems like you are using an outdated version of NxParser for your crawling. In r110 (Augunread,RDFXMLParser emit ASCII N-TriplesHi Andreas, seems like you are using an outdated version of NxParser for your crawling. In r110 (Aug7/15/13Altruist, Andreas Harth26/18/13Make LDSpider behave like a non linked data spider.Hi, On 18/06/13 02:41, Altruist wrote: > Is it possible for the LDSpider to behave like regularunread,Make LDSpider behave like a non linked data spider.Hi, On 18/06/13 02:41, Altruist wrote: > Is it possible for the LDSpider to behave like regular6/18/13Altruist, … Andreas Harth36/3/13Any idea about why I get this exception?Hi, the server does not respond quickly enough, leading to a timeout. AFAIK there are command lineunread,Any idea about why I get this exception?Hi, the server does not respond quickly enough, leading to a timeout. AFAIK there are command line6/3/13Florian Kleedorfer, … Jürgen Umbrich95/15/13Frequent crawling with same seed - download only delta?On Friday, May 10, 2013 7:00:51 PM UTC-3, juum wrote: great that it is of interest. It would be veryunread,Frequent crawling with same seed - download only delta?On Friday, May 10, 2013 7:00:51 PM UTC-3, juum wrote: great that it is of interest. It would be very5/15/13albertonm81, Andreas Harth45/9/13Starting with LDSpiderThanks for the information about Frontiers, but maybe I didn't explain myself correctly. What Iunread,Starting with LDSpiderThanks for the information about Frontiers, but maybe I didn't explain myself correctly. What I5/9/13Ron Koron, … Florian Kleedorfer105/7/13Writing to Triple StoreHi, I think I found a solution: It seems virtuoso doesn't like 'INSERT DATA INTO [graph] ..unread,Writing to Triple StoreHi, I think I found a solution: It seems virtuoso doesn't like 'INSERT DATA INTO [graph] ..5/7/13Umutcan Şimşek, Jürgen Umbrich23/22/13specifying predicate filterHi, sorry for the late repsonse The wiki pageo n the google project page should contain the necessaryunread,specifying predicate filterHi, sorry for the late repsonse The wiki pageo n the google project page should contain the necessary3/22/13Altruist, Andreas Harth62/17/13LDSPider Warning when using with Fuseki ServerAndreas, This is how the deprecated URIs are being generated. CREATE+SILENT+GRAPH+%3Chttp%3A%2F%unread,LDSPider Warning when using with Fuseki ServerAndreas, This is how the deprecated URIs are being generated. CREATE+SILENT+GRAPH+%3Chttp%3A%2F%2/17/13Altruist2/14/13DEprecated SPARQL 1.1 INSERT Queries.Hi All, The current latest version of LDSpider creates deprecated SPARQL 1.1 INSERT Queries.Are thereunread,DEprecated SPARQL 1.1 INSERT Queries.Hi All, The current latest version of LDSpider creates deprecated SPARQL 1.1 INSERT Queries.Are there2/14/13Altruist, Andreas Harth52/2/13Exception while parsing a robots.txtThanks a lot Andreas though I will make that change thanks. So is this an issue with Norbert code isunread,Exception while parsing a robots.txtThanks a lot Andreas though I will make that change thanks. So is this an issue with Norbert code is2/2/13Altruist1/21/13Iteratively call the crawler.Hello All, I need to crawl a lrge set
2025-04-05Of URLs that are stored in a file and since adding all the URLsunread,Iteratively call the crawler.Hello All, I need to crawl a lrge set of URLs that are stored in a file and since adding all the URLs1/21/13Altruist1/10/13LDSpider and robots.txtHi All, I have noticed that when LDSpider is given a URL to follow it apparently reads the robots.txtunread,LDSpider and robots.txtHi All, I have noticed that when LDSpider is given a URL to follow it apparently reads the robots.txt1/10/13Altruist1/9/13Time Out ExceptionsHello Folks, Any idea why I get the following exceptions and how I could avoid them ? Jan 09, 2013 8:unread,Time Out ExceptionsHello Folks, Any idea why I get the following exceptions and how I could avoid them ? Jan 09, 2013 8:1/9/13Altruist, Andreas Harth31/8/13How to crawl an entire websiteThank you Andreas, Assuming that I only need to crawl bbc.co.uk my seeds file would only contain aunread,How to crawl an entire websiteThank you Andreas, Assuming that I only need to crawl bbc.co.uk my seeds file would only contain a1/8/13Jaakko Lappalainen, … Jürgen Umbrich3412/23/12Error when performing crawlingYou are absolutely right, there were two missing inclusions :) But still the same problem, I'llunread,Error when performing crawlingYou are absolutely right, there were two missing inclusions :) But still the same problem, I'll12/23/12Jaakko Lappalainen, Jürgen Umbrich312/9/12Some issues on this projectThanks for the quick reply, I'll explain further: on Issue 1) This is the implementation of theunread,Some issues on this projectThanks for the quick reply, I'll explain further: on Issue 1) This is the implementation of the12/9/12
2025-04-05References Auer, S., Bizer, C., Kobilarov, G., Lehmann, J., Cyganiak, R., Ives, Z.G.: DBpedia: a nucleus for a web of open data. In: Aberer, K., et al. (eds.) ISWC/ASWC 2007. LNCS, vol. 4825, pp. 722–735. Springer, Heidelberg (2007) Chapter Google Scholar Brin, S., Page, L.: The anatomy of a large-scale hypertextual web search engine. Comput. Netw. ISDN Syst. 30(1), 107–117 (1998)Article Google Scholar Chakrabarti, S., Punera, K., Subramanyam, M.: Accelerated focused crawling through online relevance feedback. In: Proceedings of the 11th International Conference on World Wide Web, WWW, pp. 148–159. ACM, New York (2002) Google Scholar Chakrabarti, S., Van den Berg, M., Dom, B.: Focused crawling: a new approach to topic-specific web resource discovery. Comput. Netw. 31(11), 1623–1640 (1999)Article Google Scholar De Bra, P., Houben, G.-J., Kornatzky, Y., Post, R.: Information retrieval in distributed hypertexts. In: RIAO, pp. 481–493 (1994) Google Scholar Diligenti, M., Coetzee, F., Lawrence, S., Giles, C.L., Gori, M., et al.: Focused crawling using context graphs. In: VLDB, pp. 527–534 (2000) Google Scholar Fetahu, B., Gadiraju, U., Dietze, S.: Crawl me maybe: iterative linked dataset preservation. In: Proceedings of the 13th International Semantic Web Conference (ISWC) Posters & Demonstrations Track, pp. 433–436 (2014) Google Scholar Fetahu, B., Gadiraju, U., Dietze, S.: Improving entity retrieval on structured data. In: Proceedings of the 14th International Semantic Web Conference. Springer (2015) Google Scholar Gadiraju, U., Demartini, G., Kawase, R., Dietze, S.: Human beyond the machine: challenges and opportunities of microtask crowdsourcing. IEEE Intell. Syst. 30(4), 81–85 (2015)Article Google Scholar Gadiraju, U., Kawase, R., Dietze, S., Demartini, G.: Understanding malicious behaviour in crowdsourcing platforms: the case of online surveys. In: Proceedings of CHI 2015 (2015) Google Scholar Isele, R., Umbrich, J., Bizer, C., Harth, A.: Ldspider: an open-source crawling framework for the web of linked data. In 9th International Semantic Web Conference, ISWC. Citeseer (2010) Google Scholar Katz, L.: A new status index derived from sociometric analysis. Psychometrika 18(1), 39–43 (1953)Article MATH Google Scholar McCallumzy, A., Nigamy, K., Renniey, J., Seymorey, K.: Building domain-specific search engines with machine learning techniques (1999) Google Scholar Meusel, R., Mika, P., Blanco, R.: Focused crawling
2025-04-04