组织ID: |
de.l3s.boilerpipe |
项目ID: |
boilerpipe |
版本: |
1.0.4 |
最后修改时间: |
2019-06-05 11:24:18 |
包类型: |
jar |
标题: |
Boilerpipe -- Boilerplate Removal and Fulltext Extraction from HTML pages |
大小: |
71.54KB |
|
Maven引入代码: |
<dependency>
<groupId>de.l3s.boilerpipe</groupId>
<artifactId>boilerpipe</artifactId>
<version>1.0.4</version>
</dependency>
|
Gradle引入代码: |
de.l3s.boilerpipe:boilerpipe:1.0.4
|
下载Jar包: |
|
POM文件内容: |
<project xmlns="http://maven.apache.org/POM/4.0.0"
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 http://maven.apache.org/maven-v4_0_0.xsd">
<modelVersion>4.0.0</modelVersion>
<groupId>de.l3s.boilerpipe</groupId>
<artifactId>boilerpipe</artifactId>
<packaging>jar</packaging>
<version>1.0.4</version>
<name>Boilerpipe -- Boilerplate Removal and Fulltext Extraction from HTML pages</name>
<distributionManagement>
<repository>
<id>java.net-m2-repository</id>
<url>java-net:/maven2-repository/trunk/repository/</url>
</repository>
</distributionManagement>
<build>
<plugins>
<!-- fake out maven and install the binary artifact -->
<plugin>
<groupId>org.jvnet.maven-antrun-extended-plugin</groupId>
<artifactId>maven-antrun-extended-plugin</artifactId>
<executions>
<execution>
<phase>package</phase>
<goals>
<goal>run</goal>
</goals>
<configuration>
<tasks>
<attachArtifact file="dist/boilerpipe-${project.version}.jar" />
</tasks>
</configuration>
</execution>
</executions>
</plugin>
</plugins>
<extensions>
<extension>
<groupId>org.jvnet.wagon-svn</groupId>
<artifactId>wagon-svn</artifactId>
<version>1.8</version>
</extension>
</extensions>
</build>
<repositories>
<repository>
<id>maven2-repository.dev.java.net</id>
<name>Java.net Repository for Maven</name>
<url>http://download.java.net/maven/2/</url>
</repository>
</repositories>
<pluginRepositories>
<pluginRepository>
<id>maven2-repository.dev.java.net</id>
<name>Java.net Repository for Maven</name>
<url>http://download.java.net/maven/2/</url>
</pluginRepository>
</pluginRepositories>
</project>
|
Jar包内容: |
META-INF/MANIFEST.MF
de.l3s.boilerpipe.BoilerpipeExtractor.class
de.l3s.boilerpipe.BoilerpipeFilter.class
de.l3s.boilerpipe.BoilerpipeInput.class
de.l3s.boilerpipe.BoilerpipeProcessingException.class
de.l3s.boilerpipe.document.TextBlock.class
de.l3s.boilerpipe.document.TextBlockLabel.class
de.l3s.boilerpipe.document.TextDocument.class
de.l3s.boilerpipe.extractors.ArticleExtractor.class
de.l3s.boilerpipe.extractors.ArticleSentencesExtractor.class
de.l3s.boilerpipe.extractors.DefaultExtractor.class
de.l3s.boilerpipe.extractors.ExtractorBase.class
de.l3s.boilerpipe.extractors.KeepEverythingExtractor.class
de.l3s.boilerpipe.extractors.KeepEverythingWithMinKWordsExtractor.class
de.l3s.boilerpipe.extractors.LargestContentExtractor.class
de.l3s.boilerpipe.extractors.NumWordsRulesExtractor.class
de.l3s.boilerpipe.filters.english.DensityRulesClassifier.class
de.l3s.boilerpipe.filters.english.HeuristicFilterBase.class
de.l3s.boilerpipe.filters.english.IgnoreBlocksAfterContentFilter.class
de.l3s.boilerpipe.filters.english.KeepLargestFulltextBlockFilter.class
de.l3s.boilerpipe.filters.english.MinFulltextWordsFilter.class
de.l3s.boilerpipe.filters.english.NumWordsRulesClassifier.class
de.l3s.boilerpipe.filters.english.TerminatingBlocksFinder.class
de.l3s.boilerpipe.filters.heuristics.BlockProximityFusion.class
de.l3s.boilerpipe.filters.heuristics.DocumentTitleMatchClassifier.class
de.l3s.boilerpipe.filters.heuristics.ExpandTitleToContentFilter.class
de.l3s.boilerpipe.filters.heuristics.KeepLargestBlockFilter.class
de.l3s.boilerpipe.filters.heuristics.SimpleBlockFusionProcessor.class
de.l3s.boilerpipe.filters.simple.BoilerplateBlockFilter.class
de.l3s.boilerpipe.filters.simple.InvertedFilter.class
de.l3s.boilerpipe.filters.simple.MarkEverythingContentFilter.class
de.l3s.boilerpipe.filters.simple.MinClauseWordsFilter.class
de.l3s.boilerpipe.filters.simple.MinWordsFilter.class
de.l3s.boilerpipe.filters.simple.SplitParagraphBlocksFilter.class
de.l3s.boilerpipe.sax.BoilerpipeHTMLContentHandler$1.class
de.l3s.boilerpipe.sax.BoilerpipeHTMLContentHandler$2.class
de.l3s.boilerpipe.sax.BoilerpipeHTMLContentHandler$3.class
de.l3s.boilerpipe.sax.BoilerpipeHTMLContentHandler$4.class
de.l3s.boilerpipe.sax.BoilerpipeHTMLContentHandler$Event.class
de.l3s.boilerpipe.sax.BoilerpipeHTMLContentHandler$TagAction.class
de.l3s.boilerpipe.sax.BoilerpipeHTMLContentHandler.class
de.l3s.boilerpipe.sax.BoilerpipeHTMLParser.class
de.l3s.boilerpipe.sax.BoilerpipeSAXInput.class
de.l3s.boilerpipe.sax.HTMLHighlighter$1.class
#内容未全部加载,请点击展开加载全部代码(NowJava.com)
|
依赖Jar: |
无
|